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Preface 



In recent years nanoelectronics has been rapidly gaining in importance and 
is already on the way to continuing the outstanding success of microelec- 
tronics. While most literature dealing with nanoelectronics is concerned with 
technology and devices, it has been nearly impossible to find anything about 
the circuit and system level. The challenges of nanoelectronics, however, are 
evident not only in the manufacture of tiny structures and sophisticated nano- 
devices, but also in the development of innovative system architectures that 
will have to orchestrate billions of devices inside future gadgets. This book’s 
objective is to bridge that gap. 

The topic of this book has actually been offered as a student’s lecture at 
the University of Dortmund. Since the lecture has been held at the faculty of 
engineering, the main focus lies on electronics and on the basic principles of 
the essential physical phenomena. 

This book represents an introduction to nanoelectronics, as well as giv- 
ing an overview of several different technologies and covering all aspects 
from technology to system design. On the system level, various architectures 
are presented and important system features - namely scalability, processing 
power and reliability - are discussed. A. variety of different technologies are 
presented which include molecular, quantum electronic, resonant tunnelling, 
single-electron and superconducting devices and even devices for DNA and 
quantum computing. Additionally, the book encompasses a comparison be- 
tween nanoelectronics and the present state of silicon technologies, a discus- 
sion of the nanoelectronic limits, and a vision of future nanosy stems in terms 
of information technologies. 

This book is intended for those people who have not lost sight of the sys- 
tem as a whole. It not only covers nano-technology and its devices, but also 
considers the system and circuit level perspective, indicating the applications 
and conceivable products. The overview is written for electronic engineers, 
computer scientists, marketing people and physicists, and offers an initial ori- 
entation for students, beginners and outsiders alike. 
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On the Way to Nanoelectronics 



The experience of the past shows that throughout constant technology im- 
provement electronics has become more reliable, faster, more powerful, and 
less expensive by reducing the dimensions of integrated circuits. These ad- 
vantages are the driver for the development of modern microelectronics. The 
long-term goal of this development will lead to nanoelectronics. The first mi- 
croelectronic components and systems were quite expensive and therefore only 
an adequate solution for space travel. Nowadays integrated circuits as key 
components, are utilized in a broad range of applications. The semiconductor 
silicon is the most important material in the production of microelectronic 
circuits. Actually the limit of silicon technology is set by the manufacturing 
processes and not by silicon itself or the laws of physics. 

In the beginning of microelectronics in 1960 the way of its development was 
not clear. It was absolutely not obvious that the circuits must be integrated 
into silicon, however, it was quite obvious to use solid state switches. The 
invention of the bipolar transistor (Bardeen, Brittain, and Shockley in 1948) 
was only possible through extensive studies in the field of solid-state materials 
some decades before. This pioneering work was awarded the Nobel prize. It was 
crucial that both the Bell Telephone Company and the inventors shared their 
knowledge with the public and the further development was not restricted by 
patents. The invention of the integrated circuit by Kilby and Noice in 1959 was 
also spectacular, however, it was quite obvious to integrate both transistors 
and resistors on a single chip. For this work Kilby was awarded the 2000 Nobel 
prize. As we know today his idea was the right approach for the development 
of microelectronic circuits. Other ideas were less spectacular, however, they 
also influenced the spread of microelectronics. Some examples of these ideas 
are the planar technology, the microprocessor concept (Hoff 1971 with the 
Intel 4004), the scaling of MOS circuits (Dennard 1973) and the technology- 
invariant interface of the design rules (Mead and Conway 1980) [1]. 

The significance of microelectronics as a basic technology was hesitatingly 
accepted by the market. Nowadays we recognize microelectronics as an impor- 
tant key technology for present and future information systems. In particular 
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Devices 




Year 



Fig. 1 . 1 . Integration level of memory chips as the highest integrated circuits. In 
the past the integration level doubled every 18 months. The prediction by J. Meindl 
in 1980 was superseded by the curve of 1993, the development moves faster than 
expected 



the growing demands of information technology for more powerful microelec- 
tronic circuits will enforce the transition to nanoelectronics in the future. 



1.1 The Development of Microelectronics 

The progress in microelectronics is indicated by the integration level over the 
last decades of development. Figure 1.1 shows a curve that J. Meindl fore- 
casted in 1980 [2], Gordon Moore, one of the founders of Intel, proposed this 
representation. The Moore Plot shows that the integration level is doubling 
every 18 months. Figure 1.1 shows two tendencies, both are too reserved about 
the real development, so the curves have to be corrected to higher values [3]. 
Today we have the vision of memory chips made of silicon with a capacity of 
160Gbit (4Gbit can store a whole movie). 

Such trend curves of microelectronics show the essential characteristics of 
the devices, systems, or other parameters and can be extrapolated for future 
predictions. These “laws” are not laws of physics, they are mainly based on the 
laws of business management. They are valid because many people are working 
under the same economical conditions and technical prerequisites, and under 
the same social stability. Under these conditions progress is continuous!. In 
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Atoms per bit of information Ener 9y P er lo 9 ical operation 




Fig. 1 . 2 . The development of integrated circuits shows changes in the orders of 
magnitude. The number of atoms and the switching energy necessary for operating 
one bit are reduced by several powers of ten 



addition, the above-mentioned is only valid under the precondition that the 
scientific laws allow such a progress. At the moment the limits set by the laws 
of nature are relatively still far away, as will be shown in Chap. 15. 

Since the invention of integrated circuits w r e can see changes of their char- 
acteristics in the orders of magnitude (Fig. 1.2). If we regard the number of 
atoms that are necessary for storing or operating 1 bit then we can see the 
breath-taking development: In. 1960 one transistor consisted of 10 20 atoms in. a 
volume of 0.1 cm 3 , in the year 2000 these numbers were reduced to 10 7 atoms 
in 0.01 /irn 3 . In the same way the energy for storing or operating 1 bit de- 
creased, since the energy for charging and discharging capacities was lowered 
by the facts that one reduces the area of the capacitors from 1 cm 2 to 0.01 /un 2 
and the voltages from 10 V to 1 V. The change of 12 orders of magnitude was 
the prerequisite that microelectronics could have such high integration levels 
without running into the problems of power dissipation and thermal heating. 

The increase of the integration level was possible first of all by reducing the 
feature size of the devices and in the second place by enlarging the chip area 
and by functional integration (Fig. 1.3). It is assumed that the silicon technol- 
ogy has its limits for further miniaturization to approximately 10 run in 2010. 
To overcome this restriction, functional integration and three-dimensional in- 
tegration are possible solutions. An alternative approach is nanoelectronics. 

Nanoelectronics will offer integration levels that will be about two to four 
orders higher than those of microelectronics by 2010. Therefore we can expect 
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Fig. 1.3. Increase of the integration level over the years via three ways: smaller 
structures, larger chip areas, and functional design 



memories with capacities up to one Terabit. Nanoelectronics is characterized 
by very small structures and by very high complexities. The following chapters 
will discuss both, features. 



1.2 The Region of Nanostructures 

Figure 1.4 shows different physical levels if the feature sizes of the devices are 
scaled down. The characteristic times in semiconductors are plotted against 
the structural dimensions of the devices. These times are correlated to different 
physical effects. The shaded area covers present-day microelectronics. Outside 
this region the domain of nanoelectronics and molecular electronics is reached. 
The characteristic times and dimensions are less than 1 ps and less than 10 
nm respectively. 

For example, the depletion-layer width limits the reduction in size of the 
pn-junction in a diode [4], For a p + n-junction the depletion width can be 
approximated by 



/ 



n 



2 e (Vp — V ) 

Q N d 



( 1 . 1 ) 



The width l n of the depletion layer increases with the voltage V and decreases 
with the doping density Njj of the region. The permittivity is denoted bv £ 
and q is the elementary charge. Another important dimension is the Debye 
length 
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Fig. 1.4. Characteristic times and structures for semiconductor devices. The grey 
area concerns the present integrated circuits 



Ld = J (1-2) 

V ( l n i 

The definition of the Debye length is similar to that of the depletion-layer 
width. In (1.1) the voltage is replaced by the thermal voltage Vt = ~~~ and 
the doping density Njj by the intrinsic doping n % of a semiconductor. For a 
doped semiconductor rii is substituted by the density of the majority carriers. 
The Debye length describes the spatial extension of a perturbation inside a 
semiconductor. 

Particles moving due to thermal energy have a mean free path l between 
two collisions 

i=vthJVj. (1.3) 

Equation (1.3) depends on the thermal velocity vth an< l on the mean free time 

of the particles, which determines the mobility jli of the charge carriers. 
Quantum effects become relevant in devices if the wavelength A of the 
electrons is in the range of the feature size of the devices. 



Equation (1.4) h is Planck’s constant. The wavelength A is inversely propor- 
tional to the mass of the particle m e and the velocity v. 

Most of the nanoelectronic devices function in this range so the wave be- 
havior of the electrons has to be considered. If one continues to decrease the 
structural dimension the domain of atoms and molecules is reached. If a na- 
noelectronic device has only one structural dimension in this range w r e call it a 
mesoscopic device (meso means in between). Due to the disturbing influence 
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of the thermal energy many quantum devices are operated at low tempera- 
tures. However, these quantum effects increase with decreasing feature size 
of the devices. Therefore the devices must be very small if operated at room 
temperature. The latter is also an important point for nanoelectronics. 



dimension / jum 




Fig. 1 . 5 . The development of the feature size of chips, the limit of MOS transistors 
was predicted as 1995, today we assume the limit at 10 nm 



Figure 1.5 shows the reduction of the structure sizes with regard to the 
year of development for application specific integrated circuits (ASICs) and for 
state-of-the-art technology research samples. The development of the silicon 
technology is summarized by the Semiconductor Industries Association (SIA) 
roadmap, which is widely accepted by experts [5]. Today the roadmap predicts 
a minimum channel length of MOS transistors in the order of 10 nm, some 
years ago it was 30 nm. Below this size quantum electronic devices (QED) 
and mesoscopic devices will start to replace MOS transistors. 

The development of higher integration levels forces two conclusions: First, 
new physical effects, for example quantum effects due to small dimensions, 
replace the classical transistors. Second, the huge number of devices on a 
single chip demands new system concepts due to the reduced efficiency of 
the classic architectures. The architectures must consider fault tolerance, self- 
organization, and they must satisfy a large variety of applications. If silicon 
technology will eventually reach its limits other technologies must be applied, 
unless there is no demand from the market to introduce other technologies 
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with improved characteristics. However, this is unlikely because the demands 
of the information technology will be tremendous. 

Another limit may arise from economics when silicon technology is too 
cost-intensive. It is assumed that in 10 years the sale of chips will be equal 
to the sale of electronics. In this case the chips will be too expensive and not 
economic. If a new technology is more economic than the last one then the 
market will accept the new technology. The experiences gained from the past 
show that small structures mean low costs, so nanoelectronics may be the 
right way from the economic point of view, too. 
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Fig. 1 . 6 . Diversity of technologies of today and the future. Their potential of de- 
velopment is indicated 



Figure 1.6 gives a rough overview of the most important technologies of 
the future. A new technology must be better in performance and less in costs 
if it is to be accepted by the market. Today the choice of a specific technology 
is still open. The following chapters will describe a small selection of possible 
technologies for nanoelectronic circuits, a final conclusion is given in Chap. 16. 



1.3 The Complexity Problem 

In addition to the small structures, in the order of some nanometers, another 
point will be of great interest: the high complexity of integrated nanoelectronic 
systems. Modern microelectronic systems contain up to 100 million devices on 
a single chip. Nanoelectronics will push this number up to 1 billion devices or 
even more. The main problem is not only the large number of devices, but also 
the development time and the time for testing such systems. Another impor- 
tant point of view is the choice of an architecture for an efficient interaction 
between the subsystems. 
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The degree of complexity can be analyzed by combinatorial mathemat- 
ics. Microelectronics distinguishs between an implicit and explicit complexity. 
Memory chips, as an example for regular circuits, are characterized by many 
data bits: they show a high explicit complexity. Examples for an implicit com- 
plexity are microprocessors with their instruction sets, analog circuits with 
their complex behavior, or the wiring of a large circuit. 

The analysis of memory chips gives a starting point for this problem.. Test- 
ing of a memory chip with n memory cells, in which each cell can store 1 bit 
requires 2 n test cycles for reading and writing 0 s and 1 s. A 16 Mbit memory 
chip therefore requires 32 million test cycles. Such simple test patterns are 
not sufficient for testing all possible errors of a memory chip because the cells 
can not be regarded independently of one another. If we intend to examine 
all possible dependencies the test must cover all possible bit patterns that 
can be stored on the chip. A chip with n cells has 2 n different bit patterns, a 
chip with 16 777 216 cells requires about 2 16 777 216 « 10 5 000 000 test cycles. 
Assuming 1 ns for each cycle the testing of the whole chips lasts longer than 
the age of the universe. We can see that time is of major concern even from 
the economic point of view. This problem will rise exponentially with increas- 
ing complexity of the chips. In practice test engineering restricts itself to the 
sensitive cases in regard to the interference between cells. The price to be paid 
for this pragmatic solution is memory chips that can have soft failures. This 
problem can be diminished by special software for error correction. 

A further interesting example for the complexity in microelectronics is the 
wiring of an integrated circuit. The task is to minimize the length of the sig- 
nal or bus lines, a very complex problem. If n denotes the number of points 
or devices that should be connected there are m = nl possibilities for con- 
necting them. Problems of this nature are said to be NP complete problems. 
This problem is more complex than testing of a chip and is well known as the 
traveling salesman problem (TSP). The verification of a possible solution is as 
difficult as the problem itself. Parallel computing with present-day microelec- 
tronics is not capable of finding a solution for this problem for large values of 
n. These problems will force the development of nanoelectronics. 

The last examples show that if n becomes large m becomes very large, for 
simplicity the logarithm is introduced. An example is (1.5) for the definition 
of information content : 

H 0 = 1dm. (1.5) 

The information content Hq defines the level of complexity in bits if we ap- 
ply the logarithm to the base of two. The symbol of this logarithm is Id 
(logarithmus dualis). Similar to thermodynamics that describes the num- 
ber of inicrostates for the distribution of particles in a volume the infor- 
mation content is denoted as negentropy (negative entropy). In the follow- 
ing we give some examples for the negentropy of a different system: A card 
game has Hq = 3.2 x 10 2 bit, a microprocessor composed of 10 000 transis- 
tors Hq — 5.4 x 10 6 bit, a telephone network for 10 millions of participants 
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Hq ------ 2.3 x 10 8 bit. A television picture consists of 10 8 bit. Larger values for H 0 

offer artificial neural networks. A net with 10 4 x 10 4 cells has Hq = 2.9 x 10 9 bit, 
whereas a brain with 10 14 neurons has Hq ----- 4.6 x 10 lD bit if this simple model 

is applied. 

A comparison with entropy in thermodynamics is interesting. The entropy 
is defined by 

S = k • lnra. (1.6) 

In this relation k is Boltzmann’s constant and rn the number of microstates. 
The values that occur in thermodynamics are much larger than those of infor- 
mation technology. If we heat 1 cm 3 of silicon from 20° C to 21° C and apply 
(1.6) we get S — 3.76 JK " 1 or applying (1.5) we get Hq — 3.9 x 10 23 bit. 
Such large values are important to everyday life because mean values, e.g. 
the temperature varies between the mean value very precisely. On the other 
hand, information processing is capable of optimizing the thermal energy in 
our houses because the energy of a microprocessor is negligible compared to 
the energy of the controlled system. 



1.4 The Challenge initiated by Nanoelectronics 

The quantity of information is continuously increasing. The exponential 
growth of the data and knowledge is based on new scientific discoveries and 
particularly on the public sector. Many of our technical systems become more 
and more complex, e.g. systems for environmental protection will be widely 
used in future. These systems demand components with high-performance in- 
formation processing. In general, a system is defined as an assemblage or a 
combination of things or parts forming a complex or unitary whole. In the 
following we will concentrate on the architecture, the implementation, and 
the realization of integrated systems for all forms of information processing. 
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Fig. 1.7. Functional diagram of brilliant systems for PCs 
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1 On the Way to Nanoelectronics 



An interesting example for an innovative product is a brilliant system that 
has been launched on the market in a very simple version. Such systems offer 
the following features (Fig. 1.7): Communication with the Internet, infotain- 
ment can help to understand difficult contents, e.g. mathematical problems. 
For daity work, hypermedia, multimedia, and team-working are important 
characteristics. Additional topics are three-dimensional presentations, virtual 
realities, and knowledge filters. The concept of a silent servant working in the 
background is capable of collecting and storing relevant data, and protecting 
the system against misuse. Intelligent firewall systems prevent the systems 
from viruses. 



Physical 

Experiments 




Computational 

Experiments 




Product 1 
Product 2 

Product n 



A Better Way 

Fig. 1.8. The goals of nanoelectronics should be tackled not only from the side of 
technology but also from system engineering. The best strategies meet in the middle, 
and the goal is product-oriented development 



The continuous increase in performance for data-processing tasks is the 
main driver for microelectronic systems. In the past the performance of a 
system expressed in terms of the measure MIPS (million instructions per sec- 
ond) has increased every 10 years by a factor of ten. This increase is based 
on a continuous improvement of the technology in microelectronics. A dif- 
ferent approach for increasing the performance is based on the choice of the 
architecture. Parallel processing increases the performance of data processing 
without changing the technology. This concept does not fulfil all claims. New 
concepts such as, for example, artificial neural networks with the important 
characteristic of selforganization of its structures seem to be very promising. 

The experiences gained in microelectronics show that the development of 
new technologies is not only a matter of material science but also of product 
engineering. Therefore the development of new systems is a joint task between 
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technology development and system engineering, as Fig. 1.8 indicates. The 
development includes the concepts of top-down and bottom-up. 



1.5 Summary 

Today the driving force for microelectronics is information technology. The 
challenges of the future will demand sophisticated microelectronics, probably 
nanoelectronics. The development of reasonable concepts is a challenge for 
engineering and computer science, and an interdisciplinary task, probably for 
a new field that may be called information technology. 
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Potentials of Silicon Technology 



Nanoelectronics has to he discussed in the background of today’s microelec- 
tronic capabilities. Silicon is the predominant material of present microelec- 
tronic technologies, since it has very suitable integration properties. How- 
ever, other materials such as GaAs have even better properties. Among other 
things, the already gathered production knowledge of silicon-based technolo- 
gies represents a huge capital expenditure that still has to be found for other 
technologies. Thus, alternative technologies are interesting and provide im- 
portant information for the further development of microelectronics, but at 
present they do not show any economic importance in comparison with silicon 
technology. 



2.1 Semiconductor as Base Material 

2 . 1.1 Band Diagram of a Semiconductor 

The most important properties of semiconductors derive from the quantum- 
mechanical models (see Chap. 3). The physical shape of a body, but also its 
inside structure has an impact on the electron wave. The band diagram derives 
from the nanometric crystal, structure of the semiconductor. 

Within the crystal electrons are ruled by a periodic potential (Fig. 2.1a). 
It can be considered as periodic lined-up potential wells that are separated 
from each other by potential barriers. In this case, the W function can be 
described by Schrodinger’s equation that shows allowed energy bands as well 
as bandgaps in the E(k)-diagram. Figure 2.1b shows the E(x)-diagram, which 
is more appropriate for practical purposes, since x refers to the x-dimension 
of the semiconductor. Because of the quantum wells, electrons can only take 
discrete energy levels. The potential wells are coupled with each other via the 
tunnel effect that results in a widening of the discrete energy levels. They form 
energy bands (Chap. 3). 
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a) 



b) 



Fig. 2 . 1 . (a) Periodic potential of an undoped crystal (nanometric domain) and 
its wave function, (b) The band diagram explains the behavior of electrons in the 
semiconductor 



The E(x) -diagram comprises allowed energy bands that are separated by 
bandgaps. In terms of semiconductors, the uppermost allowed energy band 
is called the conduction band whereas the second uppermost allowed energy 
band is called the valence band. The conduction band comprises electrons as 
mobile charges that originate from the valence band; these defect electrons 
(holes) are modeled as positive charges of the valence band. The electron and 
hole density can be influenced via doping. 

What is new about this is that the semiconductor is based on two types of 
charge, namely negative and positive (only virtual) charge. Both types coexist 
and contribute simultaneously to the conduction process. This behavior is 
of fundamental relevance in terms of inhomogeneous structures like the pn- 
j unction. 

An electric field E puts the force F = qE on holes because of their elemen- 
tary positive charge q. Due to thermal agitation, particles move slowly with 
the electric field. The mean effective velocity Vf p of the particles is propor- 
tional to the electric field E: 



v Fp = jipE. (2.1) 

lip describes the mobility of the holes. This velocity is responsible within the 
semiconductor for both the charge density p and the current density Sf p - 

Sfp = pqHpE. ( 2 . 2 ) 

Since the electric field E is proportional to the applied voltage, (2.2) is 
equivalent to Ohm’s law: The current density rises with the voltage in a linear 
manner. According to a relative simple model, the mobility fi p depends on the 
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Fig. 2.2. Drift velocity as a function of the electric field. High electric fields cause 
a velocity saturation 



effective mass m* and on the time tc that describes the mean time between 
two collisions with other particles or atoms of the crystal lattice: 



___ VFp ___ qrc_ 
^ p E m* 



(2.3) 



The mean free path is equivalent to rcvth- In this context vth describes the 
mean thermal velocity of the particles, m* as well as tc have to be extracted 
from the band diagram of the corresponding crystal. 

The simple linear equation (2.2) only holds for relatively low electric fields. 
The interaction with the crystal lattice via collisions causes, for higher electric 
fields, a saturation of the drift velocity (Fig. 2.2). This saturation effect results 
in a substantial limitation for electronic systems in terms of their switching 



2.1.2 Band Diagrams of Inhomogeneous Semiconductor Structures 

Figure 2.3 illustrates different inhomogeneous semiconductor structures and 
their corresponding energy-band model [6]. 

The starting point is the undoped semiconductor (a). Its Fermi level is 
approximately located in the middle of the bandgap. Doping shifts the Fermi 
level towards the band edges of the bandgap, which results in an n-type and 
a p-type semiconductor (b). Since the pn-junction (c) is an essential inhomo- 
geneous structure, it appears in almost all semiconductor devices. The pn- 
junction degenerates to a tunneling diode if the doping levels are very high. 
The MOS structure (d) is based on an electric field that is applied vertically 
to the semiconductor surface. The potential well at the semiconductor sur- 
face can store charged particles. The bent band edges in the band diagram 
(d) reveal this potential well. The MOS transistor exploits this effect. The 
Schottky contact (e) is based on a metal-semiconductor junction and is uti- 
lized as contact and diode. Besides the potential shift the Schottky contact is 
also based on band bending. The metal-semiconductor-field-effect transistor 
(MESFET) (f) makes use of the Schottky contact. Further modifications of 
the band structure result from very thin and distinct doped semiconductor 
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Fig. 2.3. Energy-band models of different semiconductor structures as they appear 
in devices: (a) semiconductor, (b) n-doped semiconductor, (c) pn-junction, (d) MOS 
structure with inversion layer, (e) Schottky contact, (f) MESFET, (g) quantum well, 
(h) quantum barrier, (i) superlattice 



layers. For instance, quantum wells (g) as well as quantum barriers (h) can 
be tailored, which are of fundamental relevance for nanoelectronics. Periodic 
layer structures form the so-called superlattice (i) . Within distinct boundaries 
the band structure of the superlattice can be freely modified. 

Most of the mentioned plain structures appear in the next chapter. First 
of all the classic transistors will be analyzed in terms of their minimum di- 
mensions. 



2.2 Technologies 

As an overview, some results of the present semiconductor technology will 
be presented. In this context the Semiconductor Industries Association (SIA) 
takes a closer look at this very active area [5]. 
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2.2.1 Different Types of Transistor Integration 

The minimum feature size of silicon-based transistors is an important ques- 
tion for the further development of microelectronics. On the basis of recent 
investigation results, Fig. 2.4a reveals a first-order approximation of today’s 
integration limits. Distinct spacer layers smooth the junction between gate 
and drain area, which prevents an electric breakdown. Gate lengths of below 
70 nm and gate oxide thicknesses of about 4 nm are already attainable. Such 
MOS transistors operate at the limits of the tunneling effect and the Debye 
length. They occupy less than 0.1 /im 2 . These transistors can still be improved 
towards gate lengths of about 20 nm and gate oxide thicknesses of 0.8 nm [7]. 




Fig. 2.4. Today’s minimum feature sizes: 
sistor. 



b) 

E B C 




(a) MOS transistor and (b) bipolar tran- 



Due to the more complex structure of the bipolar transistor, it consumes 
more area. Nevertheless, passive insulation and self- aligned base and emitter 
contacts lead to device areas of roughly 4 nm (Fig. 2.4b). Passive insulations 
and plain surfaces are essential for future integrated circuitries. For instance, 
Si02 layers are ideally suited for a passive insulation. At present, such lay- 
ers are uniformly deposited over the whole wafer. However, in future a local 
deposition via laser activation might be attainable. 

The silicon on insulator (SOI) technique offers an almost ideal insulation of 
the devices and prevents parasitic effects. Therefore, silicon has to be deposited 
on an insulator, which can be achieved on different ways: (1) Oxygen has to 
be deeply implanted into a silicon substrate that is annealed in a second step. 
Thus, the S 1 O 2 layer separates the upper silicon layer from the rest of the 
substrate. (2) The silicon deposition on top of an insulator, e.g. sapphire, also 
leads to the typical SOI structure. However, this technique turned out to be 
too expensive. (3) It is possible to grow or to deposit a poly silicon layer on 
top of an S 1 O 2 layer. Afterward, the poly silicon layer is almost completely 
converted to a monocrystal layer by a melting procedure. The last method 
is very interesting when several active silicon layers are operated in a stack 
arrangement, which leads to a real three-dimensional integration. 
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Nevertheless, three-dimensional circuits become more and more important 
when the single-transistor area can no longer be reduced. In the past, three- 
dimensional integration was not an issue and higher packing densities have 
been realized via smaller minimum feature sizes. In general, passive insulation 
increases the production complexity substantially, but offers the integration 
of several stacked silicon layers. So-called via holes establish the interconnec- 
tion of the several layers. Additionally, three-dimensional integration is very 
interesting for sensor sj^stems . For instance, the sensor is integrated in the 
upper layer, whereas the data processing takes place in the intermediate layer 
and information storage is realized in the lower layer. 



bulk MOSFET 
a) L g >l 00 nm 





SOI/MOSFET dual-gate MOSFET 

b) 50 nm<L g <l 00 nm c) 10 nm<L g <50 nm 



d) Vertical MOSFET 
L g ^100 nm 




Fig. 2.5. Some integration-type MOS transistors with short channel lengths and 
small inverse leakage currents 



For short-channel MOS transistors the high leakage currents present a 
serious problem in dynamic circuits. The cutoff* current can be reduced by 
implementing the transistors on an insulation layer, which can be achieved by 
using SOI technologies. Another technique is the use of a second gate electrode, 
which depletes charge carriers from the channel in cutoff mode. Short channel 
lengths for vertical MOS transistors can also be realized by using a special 
series of production layers. 

Figure 2.6 illustrates how fine lateral structures can already be imple- 
mented by current technologies [8]. A fine structure can be created at the 
edge of an auxiliary deposition layer. For example, a thin edge of poly silicon 
is left behind after etching polysilicon that has been deposited on an auxiliary 
layer. By using this spacer technique MOS transistors with a channel length 
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Fig. 2.6. Production of very fine structures using the spacer technique 



down to 30 nm have already been implemented. Today these structures are 
utilized to study dedicated quantum effects. 



2.2.2 Technological Processes for Microminiaturization 

The concept of technological scaling represents the transition to finer struc- 
tures, thinner layers, more planar surfaces, and to passive insulation. Many 
technological efforts are being made to accomplish these goals and it would 
be impossible to give an complete overview. Therefore we will present some 
examples to illustrate the principles of technology microminiaturization. 

In solid-state microelectronics thin layers are utilized for the vertical struc- 
turing. A. usual structuring process illustrated in Fig. 2.7 is molecular beam 
epitaxy (AIDE). A heated substrate is exposed to molecular beams in a vacuum 
environment. These beams are emitted from so-called effusor cells. Because 
multiple cells can be used in parallel it is possible to achieve nearly every 
mixing ratio of layers and doping profile. The layer thickness is exactly de- 
termined by the opening times of the effusor shutters. Another advantage of 
the MBE is the possibility to produce nearly abrupt GaAs junctions. Such 
junctions are needed for the production of quantum- well structures. Molecu- 
lar beam epitaxy can also be applied locally by masking the substrate with a 
SiO ‘2 layer. This method is utilized to produce resonant tunnel diodes (RTDs) 
and metal gate field effect transistors which will be explained in Chap. 12. 

The horizontal structuring is often carried out by lithography processes, 
whereby short-wave radiation, for example short-wave UV, electron beams, 
X-radiation, and ion beams, are used to produce finer structures. Figure 2.8 
presents a rough overview of the different lithography methods. At first sight 
it is possible to create ever finer structures by using higher-energy radiation. 
But it must be noted that the material defects also increase proportionately. 
In the near future the UV-wave exposure, using sophisticated devices, seems 
to be sufficient to produce feature sizes down to 80 nm. 

Electron-beam direct writing is a usual method to produce fine structures. 
The major disadvantage of this approach is that all individual structures must 
be written one after the other, which consumes a lot of time. That is why 
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by electron-beam writing. In order to avoid contact copy image defects on the 
semiconductor wafer, the X-ray beams should run as parallel as possible. When 
using a normal X-ray tube, the distance between the radiation source and the 
silicon wafer is so small that many image defects occur in the peripheral wafer 
area. This error AB can be simply measured by the following rule: 

AB = s|, (2.4) 

where s represents the mask- wafer distance, B equals the wafer radius, and 
S is the distance to the radiation source. To keep this deviation small, the 
distance S has to be increased as much as possible, which can be achieved by 
extracting X-ray beams from a synchrotron. When using such a synchrotron, 
the distance S can be chosen to be relatively large, for example 10 m, so that 
the error AB is reduced to less than ±10 rim. 

Figure 2.11 presents a comparison between the different lithography meth- 
ods. The highest throughput is still achieved by optical lithography. A higher 
resolution can be attained by using X-ray or electron beams. The single probe 
methods (SPMs), whereby single atoms are manipulated, yields the best re- 
sults. Remarkable results are also produced by a structure printing process, 
the so-called nanoprinting (Fig. 2.9). Because we still encounter a lack of effi- 
cient production methods, these techniques are an important research area of 
nanoelectronics. 

Another method should be mentioned in this context: The scanning tun- 
neling microscope (STM) illustrated in Fig. 2.10 can. be utilized to visualize 
and analyze the fine structures. 

The principle of such a microscope is simple. A movable tip made of metal 
approaches a surface under vacuum conditions, until a tunnel current is rec- 
ognized. The distance to the surface is controlled by this tunnel current. If 
the current is maintained constant, then the surface structure can be deter- 
mined by measuring the tip position. In practice the distance is controlled by 
a piezoelectric element and the measuring tip can also be moved laterally by 
additional voltages, so that very small regions of a material sample can be 
scanned. 

The STM can also be utilized to write physical structures. By taking ap- 
propriate steps, the measurement tip can pick up single atoms and reinsert 
them in another position (atomic force microscope, AFM). In this way com- 
plete lines or characters have been written with atoms, as can be seen in Fig. 
3.8. The final objective of this technology is to deposit atoms at specified 
positions on a surface or in a crystal. 
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Release Further Processing 

Fig. 2.9. Production of fine structures using printing techniques 




Fig. 2.10. Fundamental principle of a scanning tunneling microscope: A measuring 
tip approaches a sample until a specific tunnel current has established. The current, 
and hence the constant distance to the sample, is maintained by an automatic control 
system 
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Fig. 2 . 11 . Comparison of different lithography techniques from the production point 
of view 

2.3 Methods and Limits of Microminiaturization in 
Silicon 

An essential objective of microelectronics is to produce ever-smaller circuit 
switches. Systematic approaches to this problem were already implemented 
very successful in the 1970s. 

2.3.1 Scaling 

The dimensions, voltages, and currents of a switching element are decreased 
in such a way that the electric field strength and thus the maximum stress 
of that device are maintained (constant-field principle). Using this scaling 
method, the dimensions of a field effect transistor are decreased by a constant 
factor a, as illustrated in. Fig. 2.12. If this method is applied to a capacitor, 
then the capacity is decreased by a: 

C = - (2.5) 

If this capacity has to be charged by a specific current I and for a given voltage 
swing V, then the switching time t c equals 
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Fig. 2.12. Scaling: The physical dimensions and the electric quantities are scaled 
down by a factor a. Thus the electric stress inside the transistor, e.g. the electric 
field, remain constant 



V'C' = t c 

V a 



(2.6) 



These considerations can generally be applied to MQS circuits. Because both 
the voltage V and the current I are scaled down by a, the switching time t ! c 
is also reduced by a, which means the circuit gets faster by scaling. 

The power dissipation of a circuit block is proportional to the voltage and 
the current, and is therefore decreased by a 2 . But because the packing density 
is also increased by a 2 , the unit-area power density stays constant. This result 
is very important in view of an integrated circuit’s heat dissipation. This 
method is the only way to prevent heat from becoming the limiting quantity 
of integrated-circuit performance. In practice the supply voltages are often 
not reduced by the given scaling factor, so that the circuit performance is 
further increased, but this methodology may lead to serious heat-dissipation 
problems. 

The power-delay characteristic is a very fundamental form of representa- 
tion. Today often more than those two quantities are utilized to characterize 
a technology. The region 1 of Fig. 2.13 shows the characteristics of some 
solid-state switching elements. This category contains the devices that can be 
implemented today. The integrated circuits may advance in region 2, if the 
necessary technological research and development efforts are made. Region 3 
lies beyond the classical physical limits and will not be reached, unless new 
concepts, for example quantum computing, are applied. 



2.3.2 Milestones of Silicon Technology 

If we compare the physical dimensions of current MOS transistors with the 
minimum feature sizes derived from physical limits, we will see that there is 
still some potential left. For these developments the Semiconductor Industries 
Association (SI A) has defined milestones up to the year 2015, which have been 
published as the so-called “roadmap 1999”[5]. 
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Fig. 2.13. Power-delay characteristics: region 1 contains the quantities of current 
technologies, region 2 is enclosed by technological limits, and region 3 lies beyond 
classical physical limits 



This roadmap is updated every few years. It is interesting that the fore- 
casted quantities often have to be corrected to higher values. A brief outline 
of important quantities of the very detailed 1999s plan is given in Table 2.1. 



Table 2.1. Selected quantities of the SI A 1999 roadmap 



Year 


2000 


2005 


2010 


2015 


Feature size [nm] 


130 


100 


50 


35 


Device density [y-kr] 


0.5 


1.7 


10 


24 


Device density [^jjyy] 


2 


9 


70 


190 


Clock frequency [GHz] 


1.5 


3.5 


10 


13 



The remarkable characteristics that are predicted for the silicon technology 
outperform current circuits by orders of magnitude. This means that future 
information technology systems will also show an equivalent increase in per- 
formance [9, 10]. 
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Fig. 2.14. Characteristic of high-performance silicon microprocessors according to 
the SI A studies from 1999. Nanoelectronics will only become established if they 
outperform classical systems by at least one order of magnitude. In the meantime 
the SIA corrected the given quantities to even higher values 



In Fig. 2.14 the characteristics of such a development are roughly estimated 
from todays point of view. For this special case the product of device density 
and clock frequency is plotted against time. According to our assumption 
the characteristic of nanoelectronics should be found on top of this curve. 
Therefore a numerical example is also given. 

The transition to nanoelectronics will only occur if it offers performance 
and device density increase by at least one order of magnitude, compared to 
standard silicon-based technologies. This transition is expected to be smooth. 
At first some functional blocks will be included in a hybrid architecture, and 
sooner or later the complete system will be implemented in nanoelectronics, 
whereby the peripheral devices will always consist of larger structures, because 
they have to communicate and adapt to our everyday environment. 



2.3.3 Estimation of Technology Limits 

In reality the method of scaling cannot be applied as ideally as we would 
assume from our initial considerations. For example the voltage equivalent of 
thermal energy Vy, which plays an important role in device physics, is not 
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Fig. 2.15. Power-delay characteristic of current silicon technologies as well as more 
advanced SET and RTD technologies, which will be discussed later 



decreased by a, unless we are not applying lower temperatures. The current 
density inside the transmission line can also not be maintained constant, as 
the electric field, and is instead increased by a. Hot electrons occur inside 
the transistors, which alter the electric properties. The RF properties of the 
transmission lines also worsen for smaller dimensions. These microstructure 
effects limit the possibilities of semiconductor electronics scaling. 

When concidering circuit engineering we also have to consider the devia- 
tion of parameters from their ideal values. The variation of transistor param- 
eters will stay constant, or rather increase after a scaling step. The variance 
AVtu of the threshold voltage will not be changed, so that the variance rela- 
tive to the supply voltage will increase: 



AV± n = AVtti 
V' V 



(2.7) 



This parameter deviation, which increases by downscaling of feature sizes, 
may lead to circuit-design problems and has to be captured by the appropriate 
design tools. 
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The physical dimensions and parameters of a minimum transistor struc- 
ture can be derived from simple considerations. Based on the fact that the 
physical limit of microelectronics is given by the quantum-mechanical tun- 
neling effect, the gate oxide thickness has to be at least 3nm, to prevent 
disturbing tunneling currents. The oxide itself may be stressed by a maxi- 
mum. electric field strength of 1~™, if no dielectric breakdown occurs. This 
equals a gate voltage of 0.3 V. The maximum tolerable substrate doping that 
still enables an inversion layer for the given field strength, can be calculated 
from the MOS transistor threshold voltage equation. This results in a doping 
of Na = 10 18 cm~ 3 . That is why there is a search for alternative insulation 
materials, which provide higher values of e r , to reduce the gate oxide field 
strength. 

The minimum distance between the source and the drain area, which 
equals the minimum transistors depletion zone width, for a supply voltage 
of 1 V, can be estimated as 30 nm. Taking a certain safety margin into ac- 
count, we should assume a minimum channel length of L — 50 nm. 

Based upon experience the active transistor area should equal eight times 
the smallest surface element, which is given by 30 nm x 30 nm (the depletion 
zone width). So the area of minimum-sized transistor is 0.72 x 10 4 nm 2 = 
0.72 x 10 10 cm 2 , which corresponds to the predictions of the SI A roadmap. 
This means that about 4 000 minimum- sized transistors could be placed on 
an area that equals the crosssection of a human hair, and 14 billion transistors 
could be placed on 1 cm 2 , if we neglect any kind of wiring. In the conducting 
state of such transistors we could locate approximately 150 electrons inside 
the channel zone, which would be involved in a switching event. 

What about the parameter variations? In the channel zone of the volume 
(50 nm) 3 we would find about 10 7 silicon atoms. In spite of the very small 
transistor dimensions this is a remarkable number. Assuming a doping con- 
centration of N a = 10 18 cm~~ 3 we could only implant 100 doping atoms. If 
we calculate the relative variance by (15.25), then we get approximately 10%. 
From the circuit design point of view this is an acceptable value. 

In the circuits of today’s microelectronics, charge packages of many elec- 
trons are processed. These packages are stored inside potential wells, of which 
the dimensions can be estimated from the Debye length (Chap. 15). Such 
potential wells can be found in semiconductor memories and sensors of video 
cameras. In the following we will estimate the number of electrons that have to 
be stored inside such potential wells, in order to guarantee reliable switching 
events. 

A MOS capacitor with an inversion layer exhibits a gate-channel-capacitance 
of 
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where A equals the capacitor area, t ox represents the electrode spacing, 
and e r is the permittivity of the gate insulator. The energy stored in the 
capacitor for a voltage Vgc equals 

W, = \c GC yhc- ( 2 - 9 ) 

This energy equals the switching energy of a field effect transistor. It is 
remarkable that the energy is not stored in the form of a charge, but inside 
the electric field. The stored charge consists of n electrons, so that the energy 
can be written as: 



W s — - nqVoc • ( 2 . 10 ) 

Considering the limit of that energy, we can assume that the voltage Vgc 
equals a multiple r of the voltage equivalent of thermal energy Vt = ~~ , and 
we get: 



W s — -nrVT. ( 2 . 11 ) 

For a given operating temperature T the switching energy only depends 
on the factor r and the number n of stored electrons. The minimum value of 
r can be derived from the I-V characteristic of a pn-junction and has to be at 
least from 4 to 8 , if the non-linearity of this characteristic is to be exploited. 

In Chap. 9 a minimum switching energy of 50 k T is derived from the 
consideration of errors caused by thermal variations. This means that at least 
from 12 up to 25 electrons must be involved, but these considerations are only 
valid for classical electronics. 

A further shrinking of the physical device dimensions as well as an addi- 
tional reduction of involved electrons make a quantum-mechanical modeling 
mandatory. In fact, microelectronics is already on a smooth transition towards 
nanoelectronics, as Waser shows in his book [11]. 

Exemplary, Table 2.2 reveals a first order approximation of the most rele- 
vant parameters of resonant tunneling devices and how they are effected when 
the RTD side length Lrtd = \/ Art d is scaled by a factor of r. The lower 
limit of the RTD side length is located in the range of Lrtd = 20 — 40 nrn. 

The gate length of the HFET is scaled separately by the factor 7 . In com- 
parison to the scaling law of CMOS the operating voltage of the MOBILE 
can only be slightly reduced, since the operating voltage already is less than 
1.0 V. Therefore the threshold voltage of the enhancement type IlFET-RTD's 
should not be reduced below Vto = 100 mV. The peak voltage of the RTD 
as well as the operating voltage have direct influence on the voltage swing 
AV « 0.9 Vdd & 2.5 Vp and can be reduced by a factor of 77 ------ 2 . . . 3 to 

Vp = 0.1 V. Since the logic 1 -level is approximately at Vh = 0.3V the drain- 
source current of the HFET is relatively small. This might be compensated by 
enlarging the gate witdh of the transistor but this also enlarges the capacitive 
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Table 2.2. Scaling law of the RTD-HFET MOBILE gate (NAND, Fan-in = Fan- 
out =2) 



term 


definition 


unit 


scaling 


basic parameters 
RTD side length 
RTD-HFET gate length 
Peak-current density 
Peak voltage 


LrtD 

L 

jp 

V P 


fim 

nrn 

kA/cm 2 

V 


1/k , K > 1 
l/y , 7 > 1 
a , a > 1 

1 / 77 , 3 > T] > 1 


derived RTD-HFET param. 










Minimum RTD area 


\ min 
A RTD 


— r 2 

— lj RTD 


pm 2 


1/k 2 


Peak current 


Ip = 


jp A RTD 


jiA 


ol/k 2 


Voltage swing 


AV = 


2 - 2.5 V P 


V 


1 h 


Operating voltage 


Vdd, 


\rmax 

V CLK 


V 


1 h 




— 2.5 


- 3 V P 






RTD capacitance 


Crtd 


s~i a min 

— yy RTD ^ RTD 


fF 


1/ K 2 


RTD speed-index 


SI = 


Ip /Crtd 


V/ns 


a 


RTD-HFET gate width 


w 




/xrn 


1/7 


Gate-source capacitance 


Cos = 


= Cos W L 


fF 


1 / 7 2 


der. param. of the MOBILE 










Load capacitance (Fan-out 2) 


C L = 


Crtd + 2 Cos 


fF 


Crtd /k 2 










+ 2 Cos/')' 2 


MOBILE Speed-Index 


SI MO 


b = Ip / 2 Cl 


V/ns 


a 


Intrinsic MOBILE switch, time 


tint — 


AV/SImob 


ns 


I/O v) 


Cycle 


T — 10 tint 


ns 


I/O v) 


Power diss. during switching 


Psw = 


3/5 Ip V P 


/iW 


a/(rj k 2 ) 


Static power diss. ( PVCR — 3) 


Pstat 


— 2/5 Ip Vp 


gW 


a/(ii K 2 ) 


Dynamic power dissipation 


Pdyn 


— Cl Vp y? 


/AV 


Q'/O 7 2 ) 


Power delay product 


Pges t 


int ~ Pdyn tint 


D 


1/(0 7 2 ) 



load of the preceding stage and therefore decreases the maximum operating 
frequency. In contrast to this the circuit performance and the fan-out can be 
controlled by the peak current density of the RTD. Therefore the scaling fac- 
tor a has been introduced. By using different scaling factors for the HFET 
and the RTD a fitting of the different devices to each other becomes practical. 
Since HFET transconductances of about g™ x — 1000 mS/mm are available 
with gate lengths below 200nra, the W / L-ratio can be scaled down in a dis- 
proportionate fashion. For the calculation of the intrinsic switching times in 
Table 2.2 the simplified case k, = j has been assumed. As an important finding 
for the III/V technology one can state that the performance of the HFET- 
RTD MOBILE can be increased by only increasing the peak current density, 
because the lateral scaling factor k has no impact on the speed-index of the 
RTD. 
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a) b) 

Masking Layer (SiCbSqNzt) 




Fig. 2.16. Etching of structures for micromechanics with anisotripic selective etch 
process (b), compared to conventional etching (a) 



2.4 Microelectronic and Mechanical Systems (MEMS) 

An interesting and very promising development to more complex VLSI sys- 
tems is the diversification to microelectronic and mechanical systems (MEMS). 
The integration of mechanical components makes it possible to bring vari- 
ous elements into a system, as actuators and sensors. This will open many 
new areas of applications, as Fig. 2.17 indicates. MEMS will also promote 
nanoelectronics (NEMS), since they can provide the sophisticated interface 
between nanoelectronics and our environment [12]. 

2.4.1 Technology of Micromechanics 

Micromechanics means the realization of mechanical structures the geometric 
dimensions of which are so small that they can not be realized with conven- 
tional precision work. In the past micromechanics used the outstanding char- 
acteristics of the silicon technology. Silicon belongs to the most investigated, 
the most understood, and technologically the most controlled materials, as 
mentioned in the introduction of Chap. 1. The strength of silicon is superior 
to steel so that tiny and complex structures can be manufactured with high 
yield and they operate with high reliability as well. 

The realization of micrornechanical parts uses a special etching technique. 
First, this technique applies an anisotropic selective etching that etches along 
the different crystal axis with different intensities. In this way we get well- 
shaped trenches. Secondly, the etching velocities depend on the doping of the 
layers, so we can realize etch stops easily. The processes use silicon nitride and 
silicon dioxide layers both for masking and electrical isolation. 

The results of this etching technology are schematically shown in Fig. 2.16. 
The anisotropic selective etching yields flat walls that go along the crystal axis, 
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Fig. 2.17. Map for micro- and nanosystems on which micromechanical components 
are integrated together wdth electronics 



in this example for a <1Q0> silicon material. Since the boron-doped layer 
represents an etch stop we get a flat bottom in the trench. The dimensions 
of the trench are defined by the window of silicon dioxide at the top of the 
substrate. 

2.4.2 Micromechanics for Nanoelectronics 

One interesting aspect of the trenches is that they offer an integration of 
various functional parts into one silicon crystal together with integrated cir- 
cuits. In this way different technologies can be brought together as Fig. 2.18 
shows. This hybrid technology combines many various components with small 
structures into one chip with high accuracy. These features are the prereq- 
uisite for nanoelectronics interfaces with conventional microelectronics. This 
technique is in use for the connection of glass fibers to a chip in which the 
fiber is adjusted in a V-shaped grove so a low-loss transition from the fiber 
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Fig. 2.18. Hybrid technology: Various components on one silicon chip [13] 




Fig. 2.19. An element for sweeping a light beam or for projecting light on photo- 
diodes or a screen 

to the light sensors on the chip is feasible. Such structures are also useful for 
high-frequency circuits. 

A structure often used in micromechanics is the floating bar over a trench. 
The bar resists from etching since it is protected by the mask. The bar can 
oscillate and has a resonance frequency. Therefore we can build a frequency 
meter with several bars or an acceleration meter with only one bar, which can 
be applied in an ABS sensor of a car. 

Micromechanics often is the key technology for sensors that we need in 
environmental technology or in the health system. A gas sensor consists of 
a trench with a suspended electrode. The electrode is the gate for the field 
effect transistor, the drain and the source of which are implemented at the 
bottom of the trench. The volume of the trench between the gate and the 
channel region can be filled up with a chemical substance that reacts with 
gas molecules that are to be detected. Since dielectric changes or ionized gas 
molecules influence the threshold voltage of the transistor the structure can 
measure the concentration of the gas. In the gate electrode holes can be pro- 
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Fig. 2.20. Cross section of a gas sensor with a field effect transistor with a suspended 
gate 

vided so that the gas molecules can intrude into the insulator of the trench. 
Such a sensor is a preceding stage of molecular electronics. In this context we 
should mention the microfluidic that is used in biological analysis systems on 
a chip, which are described in Chap. 11. 

As another example Fig. 2.21 shows an interesting actuator that is movable 
in x, y, and z directions by applying a voltage at the piezoelectric layers and 
can scan a volume of about 0.001 /im 3 . This actuator is also realized by the 
technolog}^ of micromechanics. The probe tip in the end of the bar can scan 
the surface of an object as is done at the scanning tunneling microscope. In 
nanoelectronics such a probe tip can write lines or program gates, and it can 
contact circuits, for example in memories like Millepede. 

In these ways micromechanics provides essential methods for nanoelectron- 



2.5 Integrated Optoelectronics 

In Chap. 4 it will be shown that information processing basically can be 
realized bv using photons or electrons, and that electronics are widespread 
because of their well-known advantages. But photonics also offer interesting 
properties: Information coded by light waves can be transmitted very rapidly 
and can be parallel processed easily. Difficulties arise when we consider the 
amplification and storage of information. That is why the research results in 
photonics are often criticized. The following disadvantages are often quoted 
in this context: the lack of amplification, the unsolved fanout problem, the 
missing output separation from the input of circuit elements, and the diffi- 
culties concerning the implementation of bistable elements. But solutions for 




A Piezoelectric Layers 

Fig. 2.21. Microtip for actions in the nanometer range 
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Fig. 2.22. Basic principles of optoelectronic integrated circuits (OEIC): vertical, 
lateral, and planar integration can be distinguished 



these problems could be seen to emerge. A further advantage is the fact that 
optical information processing can be carried out with low power dissipation, 
in integrated circuits that occupy only a small volume. 

Integrated optics are taken as miniaturized optical circuits, in which light 
signals are generated, conducted by waveguides, processed by corresponding 
physical effects, and finally detected by sensors. Special circuit components 
for these functional elements have been developed. It is attempted to integrate 
each of these components monolithically on a single substrate. The subsequent 
step is the attempt to integrate photonic and electronic circuit elements on 
a single chip. These kinds of systems are called optoelectronic integrated cir- 
cuits (OEICs). Different technologies for the implementation of OEICs are 
illustrated in Fig. 2.22. 

GaAs lasers are often used as light sources of integrated optics (Fig. 2.23). 
Because of their small physical dimensions, microlasers are useful components 
of nanoelectronics. Today, semiconductor lasers cannot be implemented in sili- 
con because this material is characterized by an indirect bandgap. By utilizing 
micromechanics the silicon substrate can be prepared for a hybrid technique 
for mounting of a laser diode. Another approach is given by heteroepitaxy, 
whereby GaAs layers are deposited on a silicon surface. There is also work 
progressing to determine if impurity atoms can be inserted in a silicon crystal 
in such a way, that a direct band transition of electrons becomes possible. The 
first promising results, concerning the insertion of carbon atoms, have been 
reported. 

GaAs light-emitting diodes can be used as incoherent light sources. The 
application of light-emitting silicon is also being examined, although the effi- 
ciency factor of that material, as known today, is very low. 

As a first example, an AND gate integrated on a GaAs chip is presented in 
Fig. 2.24a. Two phototransistors, which control a photodiode, are connected 
in series: The photodiode only emits light if an optical signal is applied to 
both inputs A and B. Thereby the input signal is amplified. The schematic 
of Fig. 2.24b illustrates the cross section of an integrated device: the stacked 
layers of the phototransitors and the photodiode lie on top of each other. 
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Integrated optical waveguides, which are used to connect integrated optical 
circuit devices, consist of a layer, with the refractive index 771, conducting the 
wave, and of further surrounding layers with a refractive index 772 smaller than 
rji. Because of the physical dimensions and the refractive index profile, only 
specific wave modes will be carried by the waveguide. 

In conjunction with standard silicon technology, integrated optical waveg- 
uides can be realized by a small extra effort, for example as ribbon waveg- 
uides, or as waveguides buried in the substrate. In the following we will focus 
on waveguide layers made of oxides, which can be easily produced in an ex- 
tended CMOS technology (Fig. 2.25). 

Photonics are particularly interesting for complex connected structures, 
because they enable a high degree of parallel processing, because light waves 
have no need for electric conduction, and because they offer a high immunity 
to electromagnetic disturbance. Light waves are characterized by very high 
frequencies, so that they are very suitable for bulk transmission of data. If we 
consider an optical fiber, we have a bandwidth of 50 000 GHz at our disposal. 
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Fig. 2.23. Semiconductor laser diode and a microlaser. The physical dimensions of 
the microlaser are 7 pm diameter, and 10 pm length 
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Fig. 2.24. Nonlinear elements for optoelectronics: (a) an AND gate consisting of 
phototransitors and a photodiode, and (h) its technological implementation 
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With this huge bandwidth we can feed the inputs of very complex nanoelec- 
tronic chips by way of example. This can be achieved by using parallel light 
rays, or by using a single light wave and a particular modulation procedure. 

An interesting application for nanoelectronics combined with nanotech- 
nology provides the intelligent dust, shown in Fig. 2.26. Such a dust particle 
consists of a complete data-processing system. In this case nanostructures are 
mandatory. In addition, the system combines optoelectronics and power sup- 
plies, for example with a thick- film battery and a solar cell. Intelligent dust 
can help to solve difficult problems such as the efficient movement of the fin- 
gers of a piano player or the optimization of a jet stream. It is an example for 
ubiquitous computing. 



2.6 Conclusion 

Today, silicon is still the most important base material for microelectronics, 
because it is very suitable for integrated circuits and thereby covered by in- 
depth research. By using special configurations of the silicon material, or by 
adding other materials, the limits of the silicon-based circuits have been ex- 
tended ever further. The transition to higher component densities and finer 
physical structures demands new technological steps, which have to treat the 
involved materials very carefully, and that parameters have to vary as little 
as possible. 

Today we know a significant number of technological limits, which seem to 
be almost insuperable. We have to assume that this limits can be extended by 
spending huge scientific efforts. The main research goal in this area is not only 
to avoid the parasitic effect that will occur in the course of this development, 
but also to directly use this effect in circuits and systems. 

From an electronic point of view, it is at first unexpected that microme- 
chanical components can be produced in silicon. But this approach also shows 
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Fig. 2.26. Intelligent dust and its components 



that an unusual course of thinking may lead to interesting innovations. The 
research topic of sensors and actuators will gain economic importance in the 
near future, for example as components in complex systems for environmental 
monitoring. To realize such devices, it will be necessary to implement various 
physical and chemical effects in silicon chips, in order to enable integration- 
related system production. 
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For an introduction to nanoelectronics it will be necessary to use quantum- 
mechanical concepts to explain the operational principles of these very fine 
structured devices. In this chapter the basic principles of quantum theory will 
be presented from a conceptual point of view. Quantum effects set up the 
limit of electronic miniaturization and information processing. They will be 
of great importance, if device dimensions are of the order of the de Broglie 
wavelength [14]. Another kind of quantization may be relevant to information 
processing. Therefore some principles of information theory will be presented 
at the end of this chapter. 



3.1 Some Physical Fundamentals 

3.1.1 Electromagnetic Fields and Photons 

In electrical engineering, electric and magnetic fields, voltages, and cur- 
rents occur. These values as well as electromagnetic waves are described by 
Maxwell’s equations: 



rotH = (3.1) 

f) b 

rotE = (3.2) 

II is the magnetic, E the electric field strength, J corresponds to the current 
density, D represents the dielectric and B the magnetic flux. The electric and 
magnetic fields in physical structures, e.g. in a waveguide, can be calculated by 
using these partial differential equations with the correct boundary conditions. 
Because these equations describe macroscopic electrical effects, they represent 
the basics of classical electrical engineering. The mesh and the nodal rule that 
are used to calculate the voltages V and currents I in an electrial network 
derive from these equations. 
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The electric current / is caused by the motion of charge carriers. Such 
carriers in semiconductors are electrons or holes. In microelectronic switching 
elements, e.g. in transistors, electric fields control the current. 

An interesting nanoelectronic problem is the storage of a charge package 
that is as small as possible on a very small capacitor. Since the amount of 
charge Q is quantized, it consists of n elementary charges q . From Coulomb’s 
law we can write the energy stored in the capacitor as 



W 



1 (nq) 2 

2 C 



(3.3) 



Thus the energy W on the capacitor is increased by minimizing the capacity C . 
This effect can be reached by reducing the capacitor dimensions to the 
nanometer scale. If additional elementary charges q are applied to the ca- 
pacitor by using a tunnel element, the stored energy of the capacitor increases 
abruptly stepwise. The so-called Coulomb blockade is based on this effect. 
Assuming that the potential of a very small capacitor is below the potential 
of a charge source, an electron can only flow from source to the capacitor if 
the potential on the capacitor after the transition is still below the source 
potential. Otherwise the process would not obey the principle of energy bal- 
ance. The single-electron transistor uses this effect, which will be introduced 
in Chap. 13. 

In order to calculate the behavior of a wave, for example the electric field 
distribution E(x, t) on a conduction line, we can derive the following differen- 
tial equation from Maxwell’s laws: 



d 2 E _ 1 d 2 E 
dx 2 c 2 dt 2 



(3.4) 



The electromagnetic fields of light beams (Fig. 3.1) are described by this 
equation as well. The energy flow is characterized by the Poynting vector, 
which follows from the field distributions. In this case the constant c represents 
the speed of light and the wave impedance of vacuum is given by Z G — -A- — 
377 Q. 




photon electromagnetic wave 

Fig. 3 . 1 . Light as a particle flow or an electromagnetic wave 



Light can alternatively be regarded as a flow of energy packets. These 
particles are called photons, both the wave and the particle model coexist. 
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This dualism is expressed by Einstein’s hypothesis that associates the photon 
energy with the frequency of the light wave: 

W = /*•/= y, (3-5) 

where h is the elementary quantum of action (Planck’s constant). By increas- 
ing the frequency / of a light beam, the energy packets W of the photons 
become larger. If the radiation intensity, i.e. the wave amplitude is increased, 
then the number of photons increases likewise. The light wavelength A is in- 
versely proportional to the light frequency /. Concerning the visible light 
spectrum, the frequency / is in the range of a tenth of a micrometer. Quan- 
tum electrodynamics demonstrate the particle model’s use. Feynman shows 
that all physical effects described by the wave theory can also be explained 
by a particle model. 

By nature light is suitable to transmit information. By the actual state 
of knowledge of the speed of light is the fastest possible propagation velocity. 
But the information-processing abilities of light are restricted, because until 
now no switches are known that control light as well as transistors control the 
electron currents. 



3.1.2 Quantization of Action, Charge, and Flux 

Classical physical models assume the continuity of quantities and involve no 
restrictions concerning very small physical structures. The quantum theory 
shows, however, that values of some measurable variables of a system, can 
attain only certain discrete values. The smallest possible jumps in the values 
of those observables are called “quanta”. A quantum is the smallest possible 
unit that has to fit into a nanoelectronic structure. The photons or light- 
quanta were already mentioned above [15]. 

Another physical quantity that does not occur as a continuous variable is 
the value of action II , which is defined as the product of energy and time, 

H = W-t = n H - ft, (3.6) 

thus action is made up of tih action quanta h. Common electrical values are 
the energy W = ™ = Vlt and the power P = y — VI. Other quantized 
values may be derived heuristically: 

1. The product I • t corresponds to the charge Q, which is quantized by 
charge packages of q. This elementary charge q is the smallest possible 
charge quantity. From quantum electrodynamics we can derive the follow- 
ing relationship for the elementary charge 



q 2 — 2ahc€Q 

q = y/2ace o • Vh = 1.60 x 10~ 19 A s, 



(3.7) 
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where a is the fine structure constant given approximately by . The el- 
ementary charge is proportional to the square root of the action quanta h. 
The Coulomb-blockade effect, which was introduced above, is based on the 
quantization of charge. 

2. The product Ut represents the magnetic flux, which is also quantized. If 
we consider the magnetic flux in a current-carrying superconductive ring 
as a function of the current, we notice that this flux changes abruptly 
stepwise. It consists of flux-quanta given by 



#0 = ; 1 Vh = 2.07 X 10" 15 V s. (3.8) 

2^2aceo y ! 



This relationship derives from (3.6) for tih ~ 1 and from the fact that 
two electrons are always contributing a supercurrent. The flux quanta 
is an elemental quantity comparable to the elementary charge q and is 
likewise proportional to the square root of the action quanta h. If we also 
take the wave impedance of vacuum Z 0 = 3771? into account we find the 
following interesting relationship 



q 4a 



(3.9) 



This equation uncovers the relation between the flux quanta, the wave 
impedance, and the elementary charge. It will be useful for the comparison 
of field effect electronics with superconducting electronics, which will be 
drawn in Chap. 14. We will see that the superconducting electronics are 
not well suited for device miniaturization because the action due to the 
flux quanta is much higher than that due to the elementary charge. 



3.1.3 Electrons Behaving as Waves (Schrodinger Equation) 

The model of the dualism of light can be transferred to other particles, e.g. 
electrons, because a physical mass can be associated with the photons energy, 
according to the relation E = me 2 . An electron of mass m e has a negative 
charge of e = —q. Both of these values can be measured experimental^ [16]. 

From a heuristic point of view we can conclude the following: If the electron 
is in motion it shows a kinetic energy of E = |mu 2 . Starting from the classical 
model such an electron would travel along with velocity v and momentum 
p :::::: rnv . (As we focus on the one-dimensional case we can use scalar values 
instead of vector notation.) The action of this process is given by 

H = mv • x, (3.10) 

which should increase continuously with x. But because action is quantized, it 
increases in abrupt steps, which results in an abruptly changing velocity v and 
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also in sudden changes of the electron energy. This contradiction is resolved 
by giving up the classical model of a continuous trajectory and moving on 
to quantum mechanics. According to this theory the electron is described by 
successive quantum-mechanical states, which represent a certain probability 
that the particle may be located in a specific spatial region. 

These measures of probability can. be calculated from the wave function 
W that results from the solutions of a partial differential equation, called the 
Schrodinger equation. 



H 2 d 2 W(x) 
2m dx 2 



+ V(x) • W(x) 



E-W(x), 



(3.11) 



where V(x) is a given potential function and h — yM Considering that V(x) 
is the Coulomb potential of a charged nucleus, e.g. of a hydrogen atom, we 
can calculate the wavefunctions W. also called states, by solving (3.11). This 
wave representation describes the behavior of the hydrogen atom more appro- 
priately than the well-known model postulated by Bohr in the 1930s, which 
involves electrons orbiting the nucleus. 

If the boundary conditions change in time we have to apply the time- 
dependent Schrodinger equation 



h 2 d 2 W(x,t) 
2m dx 2 



+ V (:i;, t) • W(x, t) — jh 



d\P(x, t) 
dt 



( 3 . 12 ) 



The Schrodinger equation is not identical to Maxwell’s equations, although 
it is treated with a very similar mathematical formalism, so engineers should 
have no problernes in applying this equation. 

According to the simple case of V(x) =0 (3.12) is solved by the complex 

wavefunction 

W = A • exp (j[kx — cut ]) . (3.13) 

The wave vector k is often used instead of the wavelength A to characterize 
waves. In contrast to the treatment of alternating currents, where complex 
syntax is only used as an elegant mathematical methodology, the complex 
representation is inevitable in this case. The probability P of finding the par- 
ticle in a specific spatial region is defined by W* - W, or |$j 2 , respectively. If we 
are considering the three-dimensional case we have to perform an integration 
over the volume V 

P = C ■ J <r<2W. ( 3 . 14 ) 

This equation reduces to a line integral over x for the one-dimensional case. 
The normalization constant C has to ensure that the integral over the entire 
possible space results in P = 1, because the probability of finding the particle 
somewhere in this region must be unity. 

This interpretation of \\P\ 2 suggests the introduction the term information. 
The information delivered by a measuring process is inversely proportional to 
the probability of localizing a particle in the observation space. Although 
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this obvious relation to information theory is interesting, the concept was not 
generally adopted by physicists. 

At the present time a direct derivation of Schrodinger’s equation is not 
known. A heuristic one can be deduced from the observation of a parti- 
cles quantized impulse according to its location x. This relation is repre- 
sented by a step function where the probability density W at the discontinuity 
points must vanish. This condition is satisfied by applying a wavefunction like 
Schrodinger’s equation. By using this approach we solve the problem of im- 
pulse quantization but also give up the classical theory of mechanics, where a 
particle moves along a continuous trajectory, as mentioned above. 

The wavelength of W follows directly from the quantization if we assume 
that II — h and x — A in (3.10): 



A = 



h 

m • v ’ 



and for the particle momentum 



P=y (3-15) 

The wavelength A depends on the propagation velocity v, and hence on the 
frequency /, which is often written as v. The higher the momentum p or the 
kinetic energy of the particle, the shorter the wavelength A of the material 
wave. The kinetic energy of the particle is given by 



— — - — k 2 
2 m A 2 2 m 



(3.16) 



For an energy value of 1 eV, which is typical for switching elements, the char- 
acteristic wavelength is 1.2 nm. The dimensions of nanoelectronic devices are 
in the same order of magnitude. 

Another important relation that derives heuristically from the model de- 
scribed above is Heisenberg’s uncertainty principle: The higher the momentum 
p, the shorter the wavelength A, because from (3.15) we get 



A h 

— . mv — — 
2 2 



(3.17) 



This relation is also valid for differences of impulse and location. Assuming 
that ~ = Ax and m • Av = Ap we obtain the uncertainty relation: 



Ax • Ap > 



h 

2 



(3.18) 



Bv extending the expression on the left side we get the alternative formulation 



AE ■ At>^. 



(3.19) 
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The uncertainty principle denotes that the location or the momentum of a 
particle, and its energy or its time of observation can only be determined 
imprecisely. This statement is very important if we are considering nanoelec- 
tronic applications, because the dimensions of such devices are so small that 
we can use the uncertainty principle to roughty estimate the relevant nano- 
electronic effects, for example the tunneling effect. 

In the following sections some important nanoelectronic structures will 
be discussed. Thereby it is inevitable to apply the wave model of matter to 
describe the behavior of the electrons involved. The upcoming example of the 
potential well shows that it is not possible to correctly determine the behavior 
of an electron in such a configuration by using the classical-particle model. 



3.1.4 Electrons in Potential Wells 



Figure 3.2 depicts the simple case of a one-dimensional potential well of length 
L that is enclosed by infinite potential walls. Because the electron is freely 
moving inside the well, its energy is given by 

E = ~—k 2 . (3.20) 

2 to v ' 

However, the particle is reflected at the barrier positions x — 0 and x — L, 
thus stationary waves build up inside the well. A standing wave can only exist 
if the length L of the potential well equals a multiple of ~ . All other possible 
wave instances will vanish due to destructive interferences. The stationary 
waves are described by the solutions of Schrodinger’s equation 

^ = (f) S in 1 ~T7 with n = 0) C 2 (3.21) 



This solution shows that an electron can never be located at the wall, 
because the wavefunctions always vanish at these specific locations. This con- 
dition results in the fact that the particle can only assume discrete energy 
values 



E n 



2 h 2 



1 h 



8nL 2 2 2tt 



TOjJ 1 . 



(3.22) 



This result is generally represented by a so-called band diagram, or energy 
diagram W = f(k ). In this special case the energy band mentioned above 
consists only of discrete values. The number of these energy values can be 
determined by the length L. The smaller the length of the potential well, the 
higher the energy W ni which can also be written as a function of the factor 
n and the fundamental oscillation frequency uji . The discretization of energy 
values plays an important role in nanoelectronics and should not be confused 
with the quantization of charge that leads to the Coulomb blockade described 
above. 
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Fig. 3.2. The discrete energy states of an electron in a potential well. The eigen- 
values lie on a parabolic curve 




Fig. 3.3. A potential well, an important element of nanoelectronics. Illustrated is 
the 1st and 3rd solution of Schrodinger’s equation 



A fundamental structure of nanoelectronics is the potential well enclosed 
by finite walls. Because of the complex boundary conditions we obtain a 
slightly different solution, which is qualitatively presented in Fig. 3.3 [17]. 

An essential aspect of this structure is the fact that the wave amplitude 
inside the potential barrier is not vanishing but exponentially decreasing. 
The same effect can be found by observing quantum-mechanical tunneling, 
which plays an important role in nanoelectronic switching elements, e.g. tun- 
nel diodes. 



3.1.5 Photons interacting with Electrons in Solids 

The fundamental physical effect of most optoelectronic devices is the interac- 
tion between photons and electrons in solid-state elements. Thereby energy 
is taken up by absorbing a photon and generating a pair of charge carriers, 
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Fig. 3.4. Energy-band model illustrating the interaction of electrons and photons 
in crystal structures 



namely an electron and a hole. By looking at the energy-band model we can 
illustrate this process by an electron that is elevated from the valence band 
to the conduction band (Fig. 3.4). The energy that is consumed for this tran- 
sition derives from (3.5). The action converted by the switching operation is 
given by 

W s t s ----- nh , (3.23) 

which is proportional to the number of quanta. If we consider the case of n = 1 
this leads to the minimum amount of energy that must occur at least at every 
switching process: 

w s >~. (3.24) 

t s 

If we substitute the switching time t 8 by the reciprocal of the frequency / we 
obtain the well-known relation between energy and frequency (3.5). Because 
the wavelength A is inversely proportional to the frequency /, the switching 
frequency is raised if we reduce the wavelength and thus the switching energy 
is increased. This is important information that shows that the switching 
energy is not continuously decreasing, but increasing if structure sizes are 
reduced to the quantum-mechanical region. Based on this model we can derive 
a quantum-mechanical limit for nanoelectronic devices, which will be discussed 
later. 

The working principle of photodiodes, which are among others used as 
information receivers, can be explained by the interaction described above. A 
photodiode receives an electromagnetic wave and converts it into an electrical 
signal (Fig. 3.5). This principle is also used in molecular electronics, because 
molecules can usually be activated only by light and not by electrical signals, 
like transistors are. 

From this model we can derive the so-called quantum noise. Photons do 
not arrive uniformly at a receiver, but follow a statistical distribution. Thus 
the useful signal is overlaid by noise, which average power P for the given 
frequency / and bandwidth Af can be written as 

P = t hfAf . (3.25) 

The noise voltage drop at a resistor R follows from the theory of electric 
quadrupole: 
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Fig. 3.5. Generation of photons: (a) emission and (b) induced emission of light 



v 2 - 4 RP = 2 RhfAf. (3.26) 

The quantum noise is the stronger, the higher the frequency and the band- 
width. It occurs particularly in systems operated at high frequencies and plays 
an important role in high-performance integrated circuits. The exact quanti- 
tative derivation follows from thermodynamics by considering the power of a 
blackbody radiator and an harmonic oscillator. 

Another important interaction effect between photons and electrons is the 
opposite of charge-carrier generation: An electron can drop back from the 
conduction band to the valence band. Thereby it releases energy and emits a 
photon. This process is illustrated in Fig. 3.5 and occurs, for instance, in light- 
emitting diodes (LEDs). Another important effect is the so-called stimulated 
transition. If light radiates onto a current-carrying semiconductor with many 
electrons in its valence band, additional transitions are induced by equiphasal 
waves, which amplifies the incident light. This principle represents the basic 
operation of laser diodes. 

The efficient emission of light is only possible for semiconductor materials 
with direct band transitions, for example GaAs. The band transition of silicon 
is indirect, thus it is not possible to induce an efficient emission of light. An 
alternative approach is the use of bremsstrahlung: Electrons in the channel of 
MOS transistors are accelerated to high velocities and afterward slowed down 
abruptly by positively charged donor atoms in the drain region. The kinetic 
energy released by this sudden change of velocity is emitted as radiance. Be- 
cause the efficiency of this process is very low it is not implemented in technical 
applications, but it is used in the quality assessment of MOS transistors. 

3.1.6 Diffusion Processes 

First we will examine how the total energy of a system is distributed among 
its particles in the state of thermodynamic equilibrium. Therefore we consider 
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the energy distribution of electrons in a crystal. This energy is based on heat 
and its average is proportional to the temperature T: 



W = 




(3.27) 



where k — 1.38 x 10~ 23 J KT 1 is the Boltzmann constant. 

To determine the probability that an electron takes a specific energy value 
we first have to calculate the entropy of an electron gas. Then we choose 
the distribution function characterized by a maximum entropy value. This 
methodology leads to the so-called Fermi’Dirac distribution 



f F = 



1 



1 + exp 



W-W' F \ ' 
kT ) 



(3.28) 



This results in a rectangular function if we choose T = 0. The probability 
/f equals one for every energy below the Fermi energy Wf, he. all energy 
states are occupied. On the other hand fp vanishes for energy values above 
Wf, this means no states are occupied. If we consider high energy values the 
Fermi distribution can be replaced by the Boltzmann statistic from (3.30). 
Manipulating the terms (3.28) becomes 



i = exp 

J Fi 



Wi 

kT 



kT 



Cp • exp 



Wi 

kT 



(3.29) 



where we transformed the continuous representation to a discrete function. 

By manipulation of terms on the left side of this equation we find a dis- 
tribution function for other types of particles. The Boltzmann distribution 
function for a gas of atoms or molecules is given by 



1 

f Bi 



C B ■ exp 




(3.30) 



and by inverting the sign before the 1 in (3.29) we get the Bose’Einstein dis- 
tribution for a photon gas 



hi = C E ■ exp 

jBEi 



Wi 

kT 



(3.31) 



This equation’s modifications derive from the mode of particle distribution in 
the phase space. 

If the density or energy of particles inside a crystal is nonuniformly dis- 
tributed, a diffusion process occurs, which attempts to restore the state of 
equilibrium. These processes are important for nano- and microelectronics , 
for example in the case of charge transport, but they are not always desired, 
for instance when physical structures in real devices are changing over time. 

In the following we will develop a plausible approach to the diffusion pro- 
cess by starting from the simple model in Fig. 3.7. We assume that the density 
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Fig. 3.6. Fermi distribution for T = 0 K and T > OK. Wf is the so-called Fermi 
level, where the slope of the function equals kT. A simplified model of the distribu- 
tion process is presented on the right side of this diagram 



of electrically neutral particles is high on one side of the crystal and low on the 
other. Because of the thermal movement the total flow rate between a region 
with a high particle density and a region with low density is proportional to 
the gradient of the density distribution. Thus the following differential equa- 
tion describes the diffusion process: 

s °= d -{-£)’ i3!,2) 

where So is the particle flow, D the diffusion constant, and represents the 
density distribution gradient. To gain a better understanding we should take 
a closer look at the diffusion constant D. Therefore we start from a simple 
model of the diffusion process. We assume that the potential wells depicted 
in Fig. 3.7 are separated from each other by potential barriers of height Wa- 
The particles can only be transmitted to a neighboring well if they get past 
the barrier. The number of particles that carry the energy needed for this 
transition is given by the Boltzmann factor from (3.27). Thus we obtain the 
particle flows in both directions: 



Sm - 


( ^ ( w * 
= ca • n(x) • exp 1 - — 


^ and 


(3.33) 


$D2 - 


~ ca • n(x + a) • exp 


W A \ 
kT }'■ 


(3.34) 



where c is a constant factor, a equals the distance of the potential wells, a • A 
is the volume of a well, n represents the particle density, a-A-n(x) determines 
the number of particles in the observed region (A is the cross-sectional area 
and is set to one), and exp (— ~A) equals the Boltzmann factor. The resulting 
particle flow can be written as 
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Sd = Sm ~ $D2 = co • (■ n(x ) — n(x + a)) • exp 



W A 

"kT 



and by using differential notation we get 

„ 2 
o d — ~c a ■ — ■ exp 



Wa\ 

' kT )' 



(3.35) 



(3.36) 



So 




dx 




Fig. 3.7. Model for explaining the diffusion process and for the derivation of the 
diffusion constant 

If we compare this equation with the particle flow in (3.32), the diffusion 
constant can be written as 



D = ca 2 • exp 



kT 



(3.37) 



The diffusion constant, and the diffusion current decrease with higher poten- 
tial barriers and lower temperatures. 

If we consider the continuit}^ relation, which describes the particle-density 
variation with time in dependence from the local variation of the particle flow 
density, 



dn 8 Sd 

dt dx ’ 



(3.38) 



the important diffusion equation can finally be written as 



_ d 2 n 
dt dx 2 



(3.39) 
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This partial differential equation represents the basic law for all physical dif- 
fusion processes. Even thermal conduction can be explained by a diffusion 
process. The thermal current, i.e. the dissipated power, is proportional to the 
temperature gradient and follows from the diffusion equation (3.32) 

Pth = (3.40) 

where the parameter k , represents the specific thermal conductivity. The ther- 
mal current through a circuit, i.e. the power dissipated bv a chip, can be 
calculated with this relation. Device limits caused by this effect will be dis- 
cussed in Chap. 15. 



3.2 Basics of Information Theory 

3.2.1 Data and Bits 

The beginnings of information theory can be attributed to telecommunica- 
tions. At first, the goal of information theory was to efficiently transmit in- 
formation (messages), e.g. along a wire line. The information content of such 
messages is higher the more improbable their occurrence. It was also shown 
that all information can be coded in the binary digits 0 and 1. 

In the following we will deal with the question: What is information? If 
information should be processed in technical systems, it must be represented 
in some kind of physical structure. This mapping onto hardware considerably 
limits information-processing capabilities. That is why these considerations 
are essential for micro- and nanoelectronic systems [18]. 

To introduce the subject we will compare an information carrier of the 
past with one of the future: on the one hand the punched paper tape, and 
on the other hand single atoms deposited on a substrate surface. In the first 
case combinations of holes are punched in a strip of paper, and in the second 
example arrangement of atoms form letter symbols (Fig. 3.8). The symbols 
coded onto one of these information carriers can be read by one single person 
or can be viewed simultaneously by a group of individuals using some kind 
of projection. The combinations of holes or atoms form data, which can be 
processed or transmitted. The recipient of that data will gain information and 
is enabled to extend his stored data volume. 

Examining this simple case we can derive the four important characteristics 
of information by U. Baitinger: 

1. Information or data can be transmitted and processed without knowing 
its specific meaning. 

2. Some kind of carrier is needed for information storage: matter or energy. 
In the example above paper is used to store the data. Wether information 
can exist without a carrier is a question of philosophy of nature. 
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► hy 

Fig. 3.8. Schematic picture of a punched paper tape, where the y coordinate is 
scanned, over time t, and. a silicon substrate with single atoms applied to its surface 
by a scanning tunnel microscope (STM). Both samples represent information 



3. The information carrier can be exchanged without losing any information. 
For example, the data can be written from a punched paper tape to a 
semiconductor memory. 

4. The transmission and processing of information can be distorted. To trans- 
mit and process information without any loss is the challenge for informa- 
tion technology. 

According to the third rule w r e assume that the information carrier can be ne- 
glected and that abstract information can be treated in forms of mathematical 
formalism. 

In technical systems, data are processed and transmitted. For example, we 
can observe series of data on a line. Because we generally do not know the 
upcoming data word, its arrival is characterized by some means of surprise. 
This kind of surprise can also be described as information, which is determined 
quantitatively by the probability that a specific pattern X r occurs. The mea- 
sure of surprise is the greater, the more seldom a pattern occurs. Therefore 
the information of a pattern X r is inversely proportional to the probability 
of its occurrence P r . According to Shannon the definition of the information 
content I r of a pattern X r is written as 

I r = -KlnP r , (3.41) 

where K is a constant factor. The logarithmic function is chosen because it 
allows two information content measures to be easily added and because large 
values of I r are scaled down to a smaller range. 

In information theory, information content is measured in bits. This unit 
is defined by a so-called dual step, where the two patterns 0 and 1 arrive with 
the probability P r = 0.5. Thus the factor K is determined according to 

I = -Id 0.5 = 1 bit, (3.42) 



and the information content of a pattern X r can be written as 
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I r = -Id P r . (3.43) 

If the pattern X r is chosen from a set of m patterns with an identical occur- 
rence probability, then the average information content equals H , also called 
expectation value or ’following thermodynamics’ entropy. 

m 

H — - ^ Pi ■ Id Pi i = l...r...m. (3.44) 

i= 1 

In the following we are considering the special case of m different incidents 
with the same occurrence probability P r = The average information con- 
tent reaches a maximum value given, by 

H 0 = —Id P r = 1dm. (3.45) 



We can now examine patterns for many different applications and calculate 
their information, content. For example, a bus system of 6 signal lines, where 
one line is always in an active state, could be compared to a game of dice 
(Fig. 3.9). This signals could trigger switching elements in an array. These 
switching positions can also be treated as different patterns. 

Extending the problem description above we now assume that not only 
one hole in a paper strip or one switch in an array is allowed, but n holes or 
switches may be activated at once. Then the number of possibilities can be 
followed from the laws of combinatorial math 



Nl 

nK7V”-nj! 



'N' 

n 



(3.46) 



This approach can be extended to a two-dimensional field of switches 
(Fig. 3.9). For N fields and n closed switches the amount of combinations 
can be derived from the equation, above. If the number of closed switches 
varies between, n = 0 and n = IV, we obtain the number of possibilities by 



summation. 



m s . 



N 

E 

n — 0 



iN 



(3.47) 



This example can be transferred to integrated circuits, where the switches cor- 
respond to transistors. The interconnect wiring is irrelevant for the calculation 
of the information content, because it is static. The sequential logic system 
of a computer changes its state over time. The various states of this machine 
can be interpreted as data or different patterns. Thus the sequence of events 
inside a computer system can also be treated with the term of information. 
In the following we will examine the transmission and processing of data in 
more detail. 

Information theory models can also be found in other fields of physics, for 
instance in thermodynamics: If we consider an ideal gas and divide its volume 
into A r cells, where every cell can only accept one particle and we want to 
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Fig. 3.9. Different patterns of data: (a) punched paper tape with single holes ac- 
tivating switches, (b) hole combinations as information, (c) two-dimensional switch 
array, (d) three-dimensional distribution of particles in a segmented volume 



distribute n particles in this volume, then the information pattern becomes a 
three-dimensional structure (Fig. 3.9). The number of possible combinations 
of distributing n particles to N cells is given by (3.45). Because n and N 
are very large we can reduce the Nl term by approximation. First we apply 
Stirling’s equation 



In Nl « N • In N — N + I In (2nN). (3.48) 

Then we reduce this expression by neglecting the last part for N In N and 
by using n <C N 

N 

In m « n + n • In — , (3.49) 

n 

where m is the number of different patterns that can be found if n available 
particles are applied to an ideal gas of the volume V. This equation represents 
also a fundamental microelectronic relation. 

For the thermodynamics Boltzmann demonstrated that the entropy S = 
of an ideal gas can also be defined by the number of microelectronic states, 
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S = k • lnm, (3.50) 

where k = 1.38 x 10~ 23 J K -1 is the Boltzmann constant. The entropy of 
a closed system is constantly increasing, because disorder grow r s under the 
influence of heat. Based on this thermodynamic consideration the minimum 
energy that is needed to process 1 bit of information equals 

W = 2k Tin 2. (3.51) 

This relation derives from S = (3.45) and (3.50). It is noticeable that 

this relation is formally equal to (3.47). This prompted Shannon to give the 
name entropy also to the information content Hq . Shannon develops the infor- 
mation theory from mathematical abstractions, but it was Brillouin’s idea to 
transfer his considerations into physics. The ideas were further developed by 
Weizsaecker and applied to microelectronics and computer science by Meindl 
and Keyes. We have to take great care in the choice of the entropies sign: 
Because the entropy decreases by receiving information, information must be 
compared to negative entropy. This is why information is also called negen- 
tropy. 

3.2.2 Data Processing 

In the second half of the last century information theory focused on infor- 
mation transmission. This process is illustrated in Fig. 3.10 and consists of a 
transmitter, a transmission channel that is usually disturbed, and a receiver. 
The processing blocks represented in this diagram play an important role in 
microelectronics. The transmitter and also the receiver have to encode, trans- 
form, and decode the information flow. 

Transmitter and receiver also occur in information processing. In this case 
they are generally presented in one person, namely the computer user. The 




Fig. 3.10. Block diagram of (a) an information transmission system and (b) an 
information processing system. The channel, as well as the computer may be affected 
by noise 
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transmission channel is represented by the computer, which not only transmits 
information, but also transforms, i.e. changes it (Fig. 3.10). 

An important characteristic of every transmission is the information quan- 
tity M measured in bits 

M = —n • 1dm, (3.52) 



where t s is the time needed for one transmission step. The total quantity 
of information can be represented as an information cuboid illustrated in 
Fig. 3.11. The following measures are plotted on the axes of this shape: The 
complexity 1dm of the symbols, the number n of symbols, and the number — 
of steps. The information flow is defined as the amount of steps per unit of 
time 



dM 

dt 



(3.53) 



measured in ~~ 



or baud. 



n Id m 




Fig. 3 . 11 . Cuboid for the illustration of information quantity. The following mea- 
sures are assigned to the three axes: degree of parallelism (bandwidth), transmission 
duration, and complexity of the patterns 



The goal of a transmission is to forward this set as complete as possible. In 
contrast to a transmission, processing aims to selectively transform the infor- 
mation. The performance of information processing is therefore not measured 
in baud, but in MIPS (millions of instructions per second). In comparison 
to the transmission process a similar cuboid can be constructed. The axes for 
this case are: the number of pattern processed in the time i c , their complexity, 
and the number of patterns per processing step. 

M = —n • Id m. (3.54) 

tg 

Because a computer system changes the processing data, the information sets 
at the input and the output may have a different size. 

Most microprocessor systems known today are based on the so-called Tur- 
ing machine: Data is read from and written to a memory, and a processor 
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Fig. 3.12. The basic concept of information processing: A processor P processes 
data from the memory DM by a program stored in PM 



processes the information (Fig. 3.12). This concept gains its outstanding im- 
portance not without the thesis of Church, which says: The Turing machine is 
universal computational, i.e. all problems that can be solved by some method 
of computation can be solved by this machine. For nanoelectronics this state- 
ment is fundamental, because it means that every thinkable method of storing 
information and virtually every set of operations on this information is suffi- 
cient to solve any computational problem. 

In Chap. 1 we did see that some so-called hard problems could not be solved 
in this way due to a lack of time. This is why we have to apply concepts of 
parallel computation. Thereby we have to use new concepts that are often 
predetermined by specific technologies. A survey of different technological 
fields is given in Fig. 3.13. In the following chapters examples of these four 
elementary concepts will be presented. 

When considering information processing we can choose between digital 
and analog technology implementations. Thereby we have to ensure that no 
information loss occurs. The sampling theorem gives an answer to this fun- 
damental question (Fig. 3.14). It can be shown that the complete information 
content can be transmitted by discrete samples, if only the sampling rate is 
chosen adequately. Current, voltage, and power values are sampled in the case 



of electronics. The time difference At between these samples has to be shorter 
than the halfperiod of the highest frequency / or bandwidth Afo a signal 



consists of. 



At < 



1 

24ft/ 



(3.55) 




Fig. 3.13. Technological concepts for nanoelectronics and nanosystems 
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This relation can be made plausible by considering a Fourier transformation: 
The signal sine-wave oscillation with the highest frequency has still to be 
captured. 



U 




Fig. 3.14. Illustration of the sampling theorem. In order to capture the signal 
completely it is sufficient to scan the signal every At that corresponds to a sine- 
wave oscillation with the highest frequency 



Another problem is to determine the accuracy of an analog value repre- 
sented in digital form. But an analog value can also be represented only with a 
restricted degree of accuracy. If a digital signal of m discretization steps is ’by 
means of measuring technology’ indistinguishable from the analog signal, the 
number of discretization steps is high enough to represent the signal without 
information loss, and both approaches are equivalent solutions. 

The signals we can observe in nature are, as a rule, analog. For the transi- 
tion to digital representation we have to apply an analog to digital converter 
(ADC). Nowadays integrated circuits development trend toward digital imple- 
mentations, because digital switching elements are easier to construct and put 
together. In many cases the transmission quality can be significantly increased 
by using digital technology, because a high degree of precision can be reached 
by utilizing long data words. Nevertheless, every system on a chip generally 
needs an analog part, because at least an input circuitry has to perform some 
kind of analog to digital conversion. 



3.3 Summary 

Classic models of electrical engineering are generally sufficient for microelec- 
tronic applications. By introducing nanoelectronics, it is necessary to use 
quantum-mechanical concepts. Different physical values are quantized, which 
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leads to novel effects. Basic measures of information theory also exhibit some 
kind of quantization. These effects not only offer new ways of technological 
developments, they also erect limits to nanoelectronics. 




4 



Biology-Inspired Concepts 



The brain of any living being can be considered as a high-performance 
information-processing system. Besides their facilities, the underlying archi- 
tectures are of general interest for the development of nanoelectronics and 
they might be helpful to set a goal on nanoelectronics. 

Biological concepts can also be copied. In the following, three examples will 
be discussed that have been selected from many cases. The first example deals 
with the co-integration of biological neurons and MOS circuitries on the same 
substrate. The second example comprises the imitation of biological neurons 
via VLSI circuitries. An artificial neuronal network with local adaptation and 
spatial interconnected data streams is revealed by the third example. 



4.1 Biological Networks 

4.1.1 Biological Neurons 

Neurons consist of a nucleus and long dendrites reaching to neighboring cells. 
The signals travel via the dendrites to other neurons (Fig. 4.1a). 

The axon forms the output of the neuronal cell. As a rule, the axon ter- 
minates in. a synapse that establishes the connection, to other neurons. In this 
context, the so-called dendrites are connected to the synapses. They are in 
charge of the input signals of the neuron. A more precise description of the 
biological neuron’s setup is available in the relevant literature [19]. 

Detailed observations of biology confirm the spread and storage of infor- 
mation in neuronal networks. The information spread takes place along the 
axons, whereas information is stored within the neurons. Short-term storage 
is realized in terms of electrical signals, while long-term storage is based on 
biochemical modifications in the synapses. 

Within the synapse an axon meets a dendrite; the junction comprises 
a presynaptic membrane, the synapse gap and the post synaptic membrane 
(Fig. 4.1b). Roughly 1 000 vesicles are located at the end of each axon. They 
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Fig. 4.1. Schematic view of a biological neuron (a) and square section of a synapse 
(b) 



comprise about 10 000 molecules of a transmitter substance and are of funda- 
mental relevance for information storage. An incoming electrical signal on the 
axon pours the neurotransmitter molecules out of the vesicles into the synapse 
gap. Thus, the synapse is a simple biomolecular processor. According to Pen- 
rose et ah, the action of a synapse lies in the range of a few quantums of action 
h. The energy that is transformed in relation to the information processing is 
very small and lies in the order of 10 fJ. Receptors of the post synaptic mem- 
brane absorb the neurotransmitters and provoke a rising or falling voltage 
level in their accompanying neuronal cell. The coupling might be excitatoric 
or inhibitoric. Beyond a distinct threshold voltage level the neuron emits a 
signal via the axon [20]. 

Both learning and drugs modify the synaptic connections of a neuronal 
cell. According to a generally accepted theory, adaptation takes place in a 
form that the adapted events run automatically (Hebbian rule). Thus, these 
events are considered to be dominated like writing or swimming. The adap- 
tation process accounts for the effort of learning. On the contrary, drugs offer 
fantastic worlds and such imaginations appear without any effort, since the 
synapses are stimulated to a great extent without any control. This causes 
fantasies within the brain that make people become addicted. 

The dimensions of neurons are in the order of millimeters, however, ax- 
ons can measure several centimeters and even in some cases more than one 
meter. Biological neurons have relatively large dimensions in comparison to 
integrated CMOS circuits. However, the three-dimensional structure of neu- 
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Fig. 4.2. Photograph of biological neurons that have been preserved by Caja [19]. 
The samples show that there are quite various neurons 



ronal networks is very advantageous, in contrast to the presently limited two- 
dimensional integrated circuits. This limitation not only accounts for a low 
packing density, but also limits the wiring and architectural facilities. 

Figure 4.2 illustrates preserved neuronal cells of the cerebral cortex. Ac- 
cording to the applied preparation method of Caja, only 1% of neuronal cells 
appear in the clear arranged photograph. Thus, the complex interconnections 
and the high packing density can only be estimated. Furthermore, the pho- 
tograph reveals the existence of different biological neurons with individual 
complex interconnects. Up to now artificial neuronal networks do not account 
for these differences. 
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Fig. 4.3. From the biological neuron via the threshold gate to the matrix architec- 
ture 



Simple architectures for artificial neuronal networks derive from biology- 
based neuronal networks (Fig. 4.3). As described before, neurons can be 
treated in a first-order approximation as threshold gates. Two adjacent gate 
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levels are fully interconnected. This architecture is equivalent to a matrix con- 
figuration. Each column corresponds to a neuron and each dot represents the 
synaptic coupling. In terms of artificial neurons, these couplings are refered 
to as weights, which evaluate the input signals. 

Additionally, distinct biology-based networks comprise feedback loops be- 
tween different gate levels. In this context, research has been focusing for 
the last twenty years on artificial neuronal networks that originate from this 
structure. However, one has to keep in mind that similar results can be gath- 
ered from physics in terms of self-organization problems, e.g. the Isirig model. 
These architectures are of fundamental relevance for nanoelectronics, since 
data processing takes place in a parallel and local fashion. The borders be- 
tween memory and processor are vanishing. 



4.1.2 The Function of a Neuronal Cell 

The dynamic behavior of a single biology-based neuron is very interesting: 
Investigations of the axon’s response time surprisingly show that the relatively 
high output signal gets activated if the input signal crosses a distinct threshold 
value. The output signal itself does not depend on the slope of the input signal 
(Fig. 4.4). Within the experiment the internal potential of the axon gets fixed 
so that the current through, the membrane becomes measurable [19]. The 
question arises, how can this behavior be explained? 




Fig. 4.4. Impulse response of the axon. The axon only gets activated beyond a 
distinct threshold value 



First, we will derive the resting potential of the neuron membrane of about 
- 85 mV (Fig. 4.5). The cell membrane of a neuron exists as a double molecular 
layer. Its thickness measures roughly 5 nm, which results in an electric field of 
about 1.7 x 10 5 V/ cm. This value is about a factor of ten below the maximum 
electrical field of a CMOS transistor. The membrane is located in an aqueous 
solution of the body with relative permittivity of e w — 81, whereas the inside 
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of the membrane has a relative permittivity of e* ------ 2. Thus, the membrane 

comprises a potential barrier for charged molecules, like Na ions, of about 



2.3 eV = 100 kT. 




Fig. 4.5. Resting potential of the neuron membrane: Sodium ions and potassium 
ions are transported by an ion pump. Due to the density ratio, the neuron membrane 
shows a potential difference 



During the metabolism of living, neurons ions are pumped through the 
membrane. Thus, different ion densities on the inside and outside of the neuron 
can be traced: For instance, the density ratio of K ions between the inside and 
outside is 400 and 10 mM/1, whereas the density ratio of Na ions amounts to 
50 and 460 mM/1. Equivalent to a doped semiconductor the different densities 
result in a diffusion voltage: 



T T kT N VI 

\ D = — In-— 

q N e , 



( 4 . 1 ) 



N ex is the density of the extracellular aqueous solution, whereas iV ? ; n ac- 
counts on the density in the cytoplasm. Thus, the potential difference of the 
K ions amounts to Vk = — 92 mV and the potential difference of the Na 
ions is Vno, ~ +55 mV. Furthermore, Cl ions cause an additional potential 
difference. 

The capacitance C m of the equivalent circuit diagram in Fig. 4.5 represents 
the membrane that is connected via internal resistances to the diffusion voltage 
supplies. (Further iontypes are also involved, but are neglected here.) With 
respect to the internal resistances, the resting potential amounts to —85 mV. 
Ingestion delivers the energy to maintain this state. 

Next, the dependence of the internal resistances on the potential will be an- 
alyzed. Surprisingly, the conductance varies exponentially with the potential 
for low potential values, whereas the conductance saturates for high potential 
values (Fig. 4.6). The conductance depends on the number of active ion chan- 
nels. The number of active channels varies with respect to the iontvpe and 
the potential (Fig. 4.6). 
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Fig. 4.6. Cell membrane, approximately 5 nm thick, with an active and inactive 
ion channel 



The degree of alternation based on the Boltzmann law: Wt describes the 
transitional energy and Nq/Nc represents the ratio of opened and closed 
channels: 
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(4.2) 



The transitional energy depends on the number n of electrically charged 
particles that pass through the channel: 



W t = Wo - qnV m . (4.3) 

In this context, the normalized conductance is of fundamental relevance 
and based on the ratio between the number of opened channels No and the 
overall number of channels N: 
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Equation (4.4) describes the exponential dependence of the experimental 
data in Fig. 4.7. The ion channels control the current over the range of sev- 
eral orders of magnitudes, which is equivalent to electronic switching devices. 
The slope and the location of the conductance-voltage characteristics is of 
fundamental relevance for the correct operation of the neuronal cells. 

From the already derived basics we might answer the next question: Why 
does the neuron behave like a threshold gate (Fig. 4.4)? The contact potential 
of the quiescent state amounts to —85 mV. Electrical pulses below —40 mV 
cause the typical charging and discharging of the capacitance C m . An external 
excitation beyond —40 mV results in an output pulse of +20 mb, which is 
independent of the intensity of the excitation. The exceeding of the threshold 
value initiates a complex program in the cell membrane: First, a rapid Na + 
current drives the potential towards +20 mV. A delayed and slower K + 
current forces the potential back to its initial state. 

This model also explains the signal propagation along the axon (Fig. 4.8). 
Axon potentials that exceed —40 mV cause ion channels. Thus, Na ions move 
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Fig. 4.7. Conductance-voltage characteristics of a cell membrane with respect to 
the conductance of Na ions and K ions 



towards the axon’s inside and change the potential of the membrane. With 
a slide delay further ion channels develop and K-ions put the potential back 
to its initial state. It takes some time until the resting state is reached again. 
During this period the signal propagates along the axon. Due to the fact 
that the axon has to pass through this regeneration period the propagation 
direction is unambiguous. 
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Fig. 4.8. Schematic view of an axon and the propagation direction of its signal 



4.2 Biology-Inspired Concepts 

Consistently, engineers are inspired by biology. Such approaches like jet 
propulsion are appropriate, however, the helicopter’s propeller is very efficient 
but does not appear in nature. The following three examples demonstrate the 
variety of possibilities. 
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4.2.1 Biological Neuronal Cells on Silicon 

Initially, an engineer would probably consider the growth of neuronal cells 
on silicon and their combination with integrated circuits as an absurd and 
impossible dream. Nevertheless, experiments of P. Fromherz (MPI) show that 
neuronal cells from the brain continue to grow in a nutrient solution. Nor- 
mally, this growth process takes place in a glass bowl under uncontrolled 
circumstances. However, if the growth process takes place on top of a struc- 
tured silicon substrate (e.g. etched channel structures in a silicon oxide layer) 
the nerve fibers grow along these channels. Channel branches make the nerve 
fibers split and the end of a channel stops the growth process (Fig. 4.9). Ac- 
cording to this method, the shape of the neuronal network is determined by 
the silicon surface structure [21]. 




Fig. 4.9. Biological cells grow on top of a silicon substrate. The electric fields of 
the neurons interact with MOS structures 



Electrical pulses can propagate along the nerve fibers. Such pulses might 
originate from a synapse or might come from an external sensor. Other forms 
of excitations are: Direct signal feeding and influenced charge in the nerve 
fiber. Thus, the nerve fiber is interconnected with an electric circuit. The 
electric pulse follows the nerve fibers that are located in the silicon channels. 
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MOS structures are capable of picking up the signals of the biological net- 
work, which enables the interconnection with integrated circuits. For instance, 
the charge of a nerve fiber can influence a charge at the surface of the silicon, 
which creates a conducting channel between the source and drain area of a 
MOS structure (Fig. 4.9). 

Up to now, the packing density of this technology is very low and such 
concepts are not suitable for computer applications. They might become in- 
teresting for medical applications, since the whole human body is accessible 
via its neuronal network. Status information of the different organs might 
be gathered by tapping the neuronal network, which could save the money 
spent huge medical equipments. Today this is still a vision, but in the future 
it might help curb the costs of the public health system. Additionally, such 
concepts are interesting for artifical limbs. However, they are also interesting 
for nanoelectronics because this hybrid technology combines biological neu- 
ronal networks with electrical circuits. The combination is advantageous, since 
the complex behavior of the biological neuronal cell becomes integrated in a 
technical system. 



4.2.2 Modelling of Neuronal Cells by VLSI Circuits 

Nanoelectronics tries to copy biological concepts for information-processing 
systems. The following example reveals the reproduction of a neuronal cell 
with its exponential input-output characteristics on the basis of MOS transis- 
tors [22]. 

The above-described behavior of the neuronal cell is equivalent to the 
MOS circuit in Fig. 4.10. The input current Ie charges the capacitance C\. 
An adequate potential change lets the subsequent latch switch. The output 
voltage is almost independent of the input voltage, however, it depends on 
the supply voltage that corresponds to the membrane potential. Simultane- 
ous^, transistor T 2 is activated and discharges the capacitance C\. After a 
time, the potential has decreased sufficiently and the latch switches back to 
its initial state and the procedure can start over again. It follows from the 
equivalent circuit diagram: The more intensive the input signal, the faster the 
capacitance C\ gets charged and the more dense the pulse train at the output. 
From the viewpoint of microelectronics this part of the axon represents the 
functional integration of a multivibrator. In terms of biology, the switching 
devices consist of a cell membrane and an aqueous solution with Na and K 
ions. 

If the MOS transistors are operated in the subthreshold domain, not only 
the function but also the exact characteristics of the membrane can be imi- 
tated, since the transistors show in this domain (just like the biological mem- 
brane) an exponential characteristic. The simple MOS transistor model as- 
sumes beyond the threshold voltage a square current- volt age dependence. 
However, below the threshold voltage a subthreshold current is observable. 
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Fig. 4.10. Equivalent circuit diagram of a neuronal cell and the output signal of 
the threshold gate behavior 



Figure 4.11 illustrates the cross section of a p-MOS transistor and will be 
helpful to explain this current. 

s G D 




Fig. 4.11. Cross section of a p-channel MOS transistor 



Similar to the pn-junction, the charge densities of the p-MOS transistor 
channel depend on the surface potentials (ps and (pn of the source and drain 
areas: 
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ps = rue w , 
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(4.5) 



rii represents the charge density at the Fermi potential. The surface po- 
tentials follow from the applied source, drain, and gate voltages: 



(ps = <Po + (VgS ~~ Vs), (pD = <Po + (Vgs ~~ Vd). 



(4.6) 
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(j ) o is the built-in potential of the gate insulator that governs the threshold 
voltage Vt p - The factor m accounts for the fact that normally the applied volt- 
age does not completely reach the semiconductor surface. To simplify matters, 
this factor will be chosen as one in the following. The channel current is based 
on diffusion and the charge gradient can be approximated form (4.5) and the 
channel length L: 



5p = P d ~ Ps 
Sx ~ L 



(4.7) 



The diffusion current is proportional to the charge gradient and amounts 
to: 
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It follows from the above: 
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In this context, Iq comprises all stationary values. Choosing the source 
contact as reference potential (Vs = 0), the equation simplifies to: 

( ^YdS_\ 

I = I 0 e v t (1 - e v t J . (4.10) 

Equation. 4.10 describes the subthreshold voltage dependence of a MOS 
transistor. Its current changes by orders of magnitude, due to the exponential 
current-voltage characteristics (Fig. 4.12). 




Fig. 4.12. Schematic view of the subthreshold current- voltage characteristics (left 
side) and the strong inversion characteristics (right side) of a p-channel MOS tran- 
sistor 
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In terms of small current values, the subthreshold current is dominated by 
leakage currents, whereas high current values are limited by the resistance of 
the inversion layer (classical model). 

On the basis of the subthreshold current model, different complex VLSI 
systems like the electronic retina and the electronic cochlea have been already 
developed. C. Mead has been primarily involved in this area and has imple- 
mented biology-inspired concepts into analog MOS circuits [19]. Clearly, these 
strategies are also of fundamental relevance for nanoelectronics. 

4.2.3 Neuronal Networks with local Adaptation and Distributed 
Data Processing 

An important objective of nanoelectronics is a local data processing structure 
in order to avoid long wiring distances within the chip. This constraint might 
be met with biology-inspired neuronal networks. In contrast to many artificial 
neuronal networks, the rule of Hebb is a simple model for the description of 
neuronal cells and is based on local adaptation. The adaptation of an advan- 
tageous network would only be based on input and output signals (Fig. 4.13). 
Nevertheless, such a network can be arranged in a matrix and modular way. 



IN- 





OUT 



Fig. 4.13. Local adaptation does not depend on a global adaptation logic, but 
has distributed weight Control units. Within such a network, the neurons can be 
arranged in a regular manner 



The essential difference in comparison with conventional neurons (Fig. 4.14) 
is the local adaptation process that is based on a weight control unit. Accord- 
ing to the rule of Hebb, adaptation only takes place if the input and output 
signals coincide. Otherwise the information falls into oblivion, which means 
the weights are weakened. 

Figure 4.15 illustrates the timing of the adaptation process: First, a weight 
is assigned to the input signal. During the following steps, the weight of the 




4.2 Biology-Inspired Concepts 73 




Fig. 4.14. Schematic view of a conventional neuron as well as the schematic view 
of a neuron that is capable of local adaptation. 



Hebb synapse develops on the basis of the coincidence of the input and output 
signal. Further adaptation steps might increase the initially small weight. This 
results in a complete conditioning, namely the learning target, like cycling. 
Usually, the Hebb neuron is not based on a narrow 7 timing and filters have to 
be applied to enlarge the time signals. Thus, the probability for a temporal 
coincidence is higher. 

The schematic diagram in Fig. 4.16 reveals the function of a single neuron 
and comprises the above-described local adaptation options. This neuron type 
is also based on synapses with inputs that can have an excitatory or inhibitory 
impact. A fundamental feature of this neuron type is the veto-input that blocks 
the synapse input. Therefore, a command with higher priority might disable 
the neuron, e.g. for safety reasons. The so-called veto-synapse additionally 
limits the interconnection region of the neurons. 



STARTING 




CONDITIONING 



CS US UR 

A A 



s 


cs 


CR 


CONDITIONED 








STATE 








UR 



UR & CR 




UR A CR 



Ac E 



Fig. 4.15. Hebbian learning: Timing of the local adaptation process 
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Fig. 4.16. Schematic diagram of a single neuron with local adaptation options 



In general, a few neurons with only some inputs can handle quite complex 
procedures. For instance, only seven neurons already control a wheelchair. For 
comparison, a fly needs only 17 neurons for its motor activity. 
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Fig. 4.17. Increasing packing densities require new architectures that might be 
inspired by neuronal networks 



Nanoelectronics will comprise millions of neurons. Nevertheless, it will be 
problematic for such a huge system to automatically adapt an efficient archi- 
tecture. But certainly, this amount of neurons will be capable of very complex 
procedures like the human brain (Fig. 4 . 17 ). Their efficiency will always be 
higher than conventional networks, if they have to deal with indefinable or 
transient environments. These concepts are also interesting since data is not 
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stored in a single dot as described in Fig. 3.12, but dynamically intercon- 
nected in three-dimensional space. The structuring process might be based 
on self organizing principles. In this context, research is still at its very early 
stage. Certainly, the results will be very interesting for nanoelectronics. 



4.3 Summary 

For orientation purposes, it is very interesting to take a closer look at biolog- 
ical neuronal networks, since they show a high system performance. There- 
fore, biology-inspired electronics is an interesting alternative to classical ap- 
proaches, but has serious disadvantages like the lack of robustness and low 
reliability, etc. In the long term, these disadvantages will make it difficult to 
transform this area of research into any application. However, it will have a 
fundamental impact on nanoelectronics. 
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Biochemical and Quantum- mechanical 
Computers 



Information processing requires new concepts for the solution of complex prob- 
lems that will become even more complex in the future. All solutions for these 
problems envisioned today use parallel computation. However, the traditional 
microelectronics does not offer appropriate architectures nor efficient wiring 
technologies for parallel processing. Therefore, these problems are a challenge 
for nanoelectronics. The difficulty of finding appropriate solutions for these 
problems gives ample scope for new, unusual ideas, too. 



a) 




Fig. 5.1. Two unusual concepts for parallel processing: (a) quantum-mechanical 
computer, (b) DNA computer, processing unit (F), selector (S) 



Unusual concepts are biochemical computers, such as the DNA computer, 
and the quantum-mechanical computer (Fig. 5.1). The following presents both 
of these concepts although, their realizations are still far away. Both examples 
show that it is important to enlarge the scope beyond the nanoelectronics im- 
plemented in solid-state materials. In this challenging case we have to consider 
all possible ways that lead to an efficient parallel processing [23]. 
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5.1 DNA Computer 

5 . 1.1 Information Processing with Chemical Reactions 

If we regard the excellent solutions for information processing of nature it is 
obvious to copy them. An example of this is the DNA. computer that carries 
out the computations by chemical reactions [24] . The data are molecules inside 
a test tube. By adding extra substances the intended reactions are carried out 
whereas a robot can automatically perform all mechanical operations and 
measuring techniques (Fig. 5.2). In this concept the single molecules are the 
nanocomputers producing a huge number of solutions. The problem is to find 
the correct solution among the enormous number of results. 




Fig. 5.2. Principle of a DNA computer: The information process occurs in a test 
tube, a robot can handle the operations automatically 



Adleman [25] proposed the DNA computer and tested it in the lab. He 
used DNA molecules (deoxyribonucleic acid) - the basis of life - which is also 
known as double helix in the cells of living beings. DNA is a double-stranded 
molecule made by stringing four nucleotides together. The four nucleotide 
building blocks are A (adenine) , T (thymine) , G (guanine) , and C (cytosine) . 
By using this four-letter alphabet, DNA can store genetic information that is 
manipulated by living organisms. The mode of operation is close to that of 
silicon-based computers with binary encoding, but DNA stores the informa- 
tion by using four symbols instead of two. 



5.1 DNA Computer 
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Fig. 5.3. Part of a DNA molecule, just in the replication process 



A DNA molecule consists of two strings where each base on one string has 
a complementary base on the other string: A matches with T, and C matches 
with G, and vice versa. During the replication process only the corresponding 
molecules fit together, as Fig. 5.3 shows. The density that can be reached with 
DNA molecules is impressively shown by biology: The density of 10 21 hit /cm 3 
or 1 hit/nm 3 is much higher than those of a neural net in the brain. As a 
comparison the information of 10 21 bit corresponds to the information of about 
one trillion CDs. 

5.1.2 Nanomachines 

The DNA polymerase, which is an enzyme, can produce a second DNA strand 
that is complementary to an existent one. In this way the DNA strand gets 
a partner - we call this knitted - and the original molecule has doubled. The 
polymerase is a nanomachine consisting of only one molecule. It sticks on a 
given DNA strand, slips along the strand, reads one after the other base and 
inserts the complementary base including the part of the backbone to the new 
DNA strand. It operates as an inverter and writes a complementary copy of 
the strand (Fig. 5.3). 

That is the story from the biological point of view, the connection to 
the computer world is done via the Turing machine (Fig. 3.12). In a simple 
form a Turing machine consists of two storage tapes and a processing unit 
that moves along the tapes. One tape acts as an input tape, which is only 
readable, whereas the processor unit can read and write to the output tape. 
According to this model the DNA polymerase operates in a similar manner 
to the Turing machine. 

At a first glance the idea of an enzyme, here the polymerase, acting as a 
processing unit is obvious. However, it is not as simple as that - some processes 
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Fig. 5.4. Polymerase as a copying machine, it works as an inverter and produces a 
complementary strand to the original one 



have to be added to get the desired solution. The following indicates the most 
essentials steps: 

1. Combination of complementary strands: If a DNA strand meets its com- 
plement they are twisted to a double helix. 

2. The polymerase as a copying machine. The existence of a so-called primer 
is necessary for the correct operation of the copying machine. The primer is 
the starting point for the knitting. The polymerase extends each appearing 
primer in a preferential direction to a complementary strand. 

3. Ligases are enzymes that combine tw T o DNA strands to a longer one. 

4. Nucleases cut the DNA strands at defined positions. 

5. The gel electrophoresis sorts the DNA molecules with regard to their 
lengths. 

6. The DNA synthesis produces DNA molecules with defined sequences and 
strand lengths. A test tube with 10 18 molecules with a length of about 50 
base pairs cost 40$ in 2000. 

Adleman solved a traveling salesman problem (TSP) with the help of a 
DNA computer described above. His computer does not consist of any silicon, 
instead of silicon it uses some rows of test tubes, partly empt}^ while others 
are filled with DNA strands. The following 5-step algorithm was proposed by 
Adleman for solving the problem at the molecular level: 

1. Generate random paths through the graph. 

2. Keep only those paths that begin with the start point and end with the 
end point. 

3. If the graph has n nodes, keep only those paths that include all n nodes. 

4. Keep those paths that include the start- and end-point and pass all nodes 
only once. 

At least one solution is available. 

This algorithm has to be implemented with DNA according to the following 
steps: 

1. Synthesis of the desired strand. 

2. Separation of strands according to their length. 

3. Mix two test tubes into one to perform a union. 





5.1 DNA Computer 
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4. Extraction of those strands containing a given pattern. 

5. Melting/annealing by breaking/bonding two DNA strands with comple- 
mentary sequences. 

6. Amplification by using poljmierase to copy existing DNA strands. 

7. Cutting of DNA strands with restriction enzymes. 

8. Attach DNA strands with complementary endings with ligases. 

9. Detection by confirming presence/absence of DNA in a given test tube. 

Many combinations will be formed in test tubes. If billions of molecules 
and reactions are performed it is quite certain that the correct path will be 
generated and will be in the final mixture. 



5.1.3 Parallel Processing 

The last process steps have to find the correct solution among 10 11 results 
by sorting the DNA strands according to the defined criteria. Actually the 
reaction was successful: After about 5 days the DNA computer came up with 
the correct solution. A high-performance computer needs much more time to 
solve this problem, roughly millions of years. 

The selective reproduction is an important process step for an efficient 
computation on the DNA level as shown in Fig. 5.5. This step drastically in- 
creases the number of nanomachines operating in the proximity of the solution. 
The combination of DNA sequences, containing the way numbers and the com- 
plementary place-names, produces among others double-stranded molecules 
containing the correct start and destination address. Figure 5.5 depicts only 
the double strand for the Harniltionan path that is the solution. The arrow at 
the backbone denotes the read direction. In order to copy the correct double 
strand, primers are added to the solution. These primers contain the name of 
the starting point (GCAG) or the complement name of the end-point (GGCT). 
In the first reproduction step the double strand is separated into two single 
strands. An appropriate primer attaches only to the longer strand (Fig. 5.5b). 
The polymerase produces a complete double strand from the single strand 
(Fig. 5.5c). If this new double strand is separated in repeated steps, further 
primers can attach at both strands (Fig. 5.5d) so that the number of these 
DNA strands doubles each step and exponential growth is obtained. This pro- 
cedure only works if the strands start and end with the correct place names. 
This method drastically increases the number of solutions that can contain 
the correct solution. In addition, it also increases the probability of getting 
the correct solution. 

A biomolecular computer on the basis of DNA accomplishes a massive par- 
allel information processing. One small drop of a solution combines 10 14 DNA 
strands per second. These processing steps consume only a small amount of en- 
ergy, since the transportation of information consumes no energy (Chap. 11 ). 
10 19 operations consume 1J. Present-day supercomputers consume 1J for 
about 10 9 operations. The laws of thermodynamics set a maximum limit on 
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30 x 10 19 operations, so the DNA computer operates quite near at the physical 

limit. 

At the present time biocomputers based on similar concepts are under 
discussion. They may store the information in proteins and will use biology- 
inspired architectures. Such visions may become realistic in the future. Again 
the TSP is a good example of the use of those concepts. 



5.2 Quantum Computer 

5.2.1 Bit and Qubit 

In the recent past the quantum computer indicates new potentials for effi- 
cient parallel processing [26]. The following describes a vision of this concept 
especially with regard to nanoelectronics. This vision shows that a new con- 
cept can overcome the barriers of classical information processing. Quantum 
computing is also subject to the fundamental assumption that information is 
associated with physical quantities. Whereas the classical laws of physics pre- 
cisely define the information processing, quantum computing uses quantum 
objects as the information carrier. In our opinion these objects are subject to 
uncertainty. 





Fig. 5.6. Interaction of light waves with atoms, the structure inverts the input 
signal. However, this process describes no Qubit and is not based on the principle 
of quantum computing 



At first glance we think that quantum computing applies quanta as bits. 
Figure 5.6 show^s an inverter that uses an atom as a two-state system. If pho- 
tons pass the atom it changes its state, i.e. the electron moves to adjacent 
energy states by absorption or emission of a photon. Although this structure 
works on the quantum level its mode of operation is not typical for quantum 
computers. The above-mentioned is only an example for the classical w T ay of 
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information processing at the atomic level and an example of the classical 
nanoelectronics. An inverter as such may be useful for input/output stages 
of a quantum computer. A similar structure on the quantum level offers the 
quantum cellular arrays, QCA, which chapter 10 will present. 

In this context the so-called EPR experiment from Einstein, Podolski, and 
Rosen, w T hich was the idea of an experiment, is worth mentioning and was 
recently confirmed. This experiment describes a two-electron system that is 
composed of two electrons with opposite spin. In the following we separate 
them and force them to fly in opposite directions. If we observe one of the 
electrons far away from the other and define the direction of its spin by obser- 
vation of this electron, simultaneously the spin of the other electron changes 
to the opposite spin. This experimental result is inconsistent with our ex- 
perience since the information about the spin must be transmitted between 
the two far-away electrons without any delay that would be faster than the 
speed of light. This effect is relevant for quantum computers and especially 
for quantum cryptography. 
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Fig. 5.7. Various realizations of a Qubit, H. Weinfurter and A. Zeilinger 



The association between a bit and a two-state system is crucial for quan- 
tum computing. Examples of a two-state system can be different polarization 
states of a photon, the spin direction of electrons, the energy level of an atom 
or a quantum dot, or the modes of a beam splitter summarized in Fig. 5.7. 
If we denote both states of the system with | a) and \b) we can define the 
following identification: 



0 \a) , 1&\ b). (5.1) 

This symbolic notation is common in physics and denotes the assignment 
of a logical 0 and 1 to physical quantities. The fundamental new concept 
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of quantum mechanics compared to the classical laws of physics and data- 
processing technology is the coherent superposition of states. Therefore, the 
superposition of two states leads to the following result: 

|^) — ol\cl) + P\b) = a|0) + /? 1 1 ) . (5.2) 

From the viewpoint of physics this formula denotes that a bit is in the state 
| a) with the probability a 2 , corresponding to a logic 0, and additionally it is 
in the state |6) with the probability /? 2 , corresponding to a logic 1. A graphical 
illustration (Fig. 5.8) shows where the vector W is projected on the |0) and |1) 
axis. The value of the bit is undetermined before observation, which yields 
one of the two results with the given probabilities. Physicists call this effect 
coherence. 




Fig. 5.8. Representation of a Qubit as a vector 



However, if the described effect is only a mixture of various states no 
additional information content emerges. In our case this effect is a coherent 
superposition representing not only the states 0 and 1 but also intermediate 
states. Using this we get new operations of data processing that extend the 
possibilities of classical physics. A bit represented by quantum-mechanical 
laws is called a Qubit It is described by a vector in the so-called Hilbert 
space. 

5.2.2 Coherence and Entanglement 

The coherent superposition of several Qubits is much more interesting than 
a single Qubit. As an example a three- Qubit string is represented by a three 
two-state systems 



1010) - \a)\b)\c). (5.3) 

The important point is the correlation of the string to a unique state (one 
vector) in the Hilbert space. In a simple case the three Qubits may be in the 
following state 
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|000> + |111)/V2. (5.4) 

Equation (5.4) assigns all three Qubits either the value 0 or the value 1 
with a probability of 0.5. This result is fundamentally new in contrast to 
the classical physics where the bits are independent of each other. It is also 
fundamental, that the value of one of the three Qubits is determined from 
the beginning. The observation, of one of the three Qubits stops the uncer- 
tainty immediately and forces the other Qubits to specific values, which follow 
from (5.4). This characteristic of quantum mechanics is called entanglement 
The advantage of this effect is that the measurable state of all three Qubits 
appears simultaneously and not due to complex coupling between gates. 

The following numerical example demonstrates the potential of quantum 
computing. In the classical way we can store about 10 80 bits if all 10 80 atoms 
of the universe are used. By using Qubits we only need 266 atoms to store the 
same amount of information. 




Computer 



Fig. 5.9. Principle of a quantum computer, C. Bennett, IBM 



5.2.3 Quantum Parallelism 

A quantum computer has a number of Qubits as inputs and as outputs. The 
outputs are derived from the inputs on the basis of quantum-mechanical op- 
erations. Figure 5.9 shows the principle of such a simple quantum computer. 
In the simplest case the input state is the product of N Qubits. The unitary 
transformation produces a well-defined product state at the output. With help 
of the superposition any input state is the coherent superposition of many 
product states, where each state can have the probability of 1/N: 
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| Input) ------ {{Input i) + \Input 2 ) + . . .} /ViV. (5.5) 

The entanglement produces a superposition of all states that correspond to 
the respective input states. A quantum computer carries out a large number 
of computations simultaneously for a large number of various input states. 
This is called massive quantum, parallelism. 

Real-world data processing also requires nonlinear elements for logical in- 
terconnections. In this case the coherent modification of one Qubit is due 
to the state of another Qubit. For example, a series of three Qubits realizes 
an AND gate by choosing the corresponding combinations. Again the output 
shows a superposition of all possible results of this logical interconnection. The 
gate has computed the output signal for all possible inputs at once, which is a 
characteristic for the quantum parallelism. This concept is the basic principle 
of quantum computers. 

The quantum computer uses a fundamentally new principle, which is dif- 
ferent from the principle of the Turing machine, the basis of present computers 
[27]. The inherent parallel processing of quantum computing enables the com- 
putation of very complex problems, wdiere classical computers will probably 
fail completely. As an example the factorizing of large numbers into prime 
numbers is essential for new concepts in cryptography. Therefore the field 
of cryptography will become one of the first applications for quantum com- 
puting. Once quantum computing is well established many other important 
applications may come into existence. One of those applications could be an 
association memory that is a key element for sophisticated speech-recognition 
systems. 

Today the realization of a quantum computer is quite far away although 
physicists have realized systems with a few Qubits. The difficulty arises from 
the fact that quantum effects, which usually occur on the atomic level have 
to be controlled on the macroscopic level. 

The realization of Qubits is difficult since they have to form a closed nano- 
system without interference effects with the surroundings. In such a system 
an electron can occupy two states and can interact with a photon so the Qubit 
can be set. Nowadays ion. traps can set the vector of the Qubit with the fre- 
quency of the light. An appropriate solution for computers are quantum dots 
in solid states. However, this concept needs high-purity and perfect materials. 
Figure 10.22 shows schematically such a quantum-dot field, samples of it have 
been made and tested in research laboratories. 

A very promising concept comes from Mooij from the university of Delft: 
The Qubit consists of a superconducting ring with three Josephson elements. 
The magnetic flux quantum with its two states acts as a Qubit, as shown 
in Fig. 14.13. Advantages are the reasonable coupling of this element to the 
outside w r orld and the fact that this effect occurs at the macroscopic level. 
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In principle the quantum computer offers fundamentally new features for 
data processing on the basis of quantum mechanics without introducing new 
technologies except those developed for nanoelectronics. 



5.3 Summary 

The examples of the DNA computer and the quantum computer show that 
different approaches for parallel computing on the nanometer level exist and 
that they are not restricted to electronics. The difficult problem of efficient 
parallel processing demands extraordinarj^ concepts, so new innovative con- 
cepts will possibly presented in future. 

Quantum computing comes up with a fundamentally new concept that 
goes far beyond the Turing machine. This concept is a good candidate for 
future applications, such as associative systems or cryptography. 
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Parallel Architectures for Nanosystems 



For many years, research has focused on parallel data processing microsys- 
tems (Fig. 3.12) in order to enhance the computational power. Therefore, the 
following discourse discusses to what extent the future nanoelectronic devices 
are suitable for highly parallel systems. In this context, known architectural 
concepts have to be checked or modified. This check is necessary, since the 
new nanoelectronic devices show qualitatively new 7 properties in terms of their 
physical dimensions, power dissipation, gain, number of stable states, clock 
frequency, and above all integration level. The integration level is expected to 
increase from hundreds of million to hundreds of billion devices. 

From this point of view, a closer look will be taken at some concepts 
that will come into play before the DNA computer and quantum computer 
do. However, the stress lies on the importance of technology for the specific 
architectural system. This chapter does not claim to examine the fields of 
parallel data processing in general. 



6.1 Architectural Principles 

6.1.1 Mono- and Multiprocessor Systems 

Figure 6.1 depicts an overview of mono- and multiprocessor systems. Presently, 
the most common concept is based on the von Neumann principle and sepa- 
rates the processing unit from the memory (Fig. 6.1a). Usually, the memory 
is split into a program memory (PM) and a data memory (DM). According 
to the instructions of the program, the processing unit processes the data. 
The faster the instructions are executed, the higher will be the performance 
of the system. When the performance of a single processor is not sufficient, 
additional processors can be attached to the monoprocessor that, for instance, 
it evaluates complex algorithms in parallel (Fig. 6.1b). 

Today’s microprocessors have been successfully designed on this basis. 
They are advantageous due to their simple digital logic gates and their plain 
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a) 



c) 




Fig. 6 . 1 . Processor architectures (a) processor according to the von Neumann prin- 
ciple, (b) enhanced performance via cooperate processors, (c) processor array, e.g. 
systolic array, (d) processor nodes PN connected via weights W between the proces- 
sor nodes 



architecture. Their concepts are clear, but anyhow from, the point of view 
of microelectronics today’s microprocessors are quite ineffective, since only a 
small portion of the whole circuit is active simultaneously. The rest of the 
circuit is in an inactive state and contributes only to the power dissipation. 
For this reason, future concepts have to shut down these inactive circuit parts. 

Another approach tries to process data in a parallel manner with nanosys- 
tems. Obviously, the computational power of a system can be extended if 
many processors work on the same task. In this context, the harmonization 
of large processor areas turns out to be problematic. Up to now, only a few 
concepts attain an efficient load balancing. In the long term, a possible solu- 
tion might exist in a form of relatively loose coupled straightforward devices. 
In the following, some examples of such concepts will be given. Systolic arrays 
like in Fig. 6.1c are typical prototj^pes of today’s processor arrays. Connective 
systems (Fig. 6. Id) like artifieal neuronal networks form another processor 
array type. In this context, the next chapter deals with the relevant concepts. 

In the 1980s Seitz illustrated the different concepts in a single diagram for 
the first time (Fig. 6.2). The diagram shows the number of processor units as 
a function of the integration level. 

Figure 6.2 comprises two extremes: On the left-hand side, memory ap- 
plications depend only on a few devices per logic function, whereas on the 
right-hand side, single processor units require all devices of a single chip. In- 
between, all sorts of combinations are feasible, for instance the associative ma- 
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Fig. 6.2. Architectures of multiprocessor systems, originally from S. Seitz: pro- 
grammable logic array (PA), multiprocessor system (MP), single processor system 
(SP) for a technology level of 1 Gbit memories 



trix AM, the selforganizing map SOM, the cellular neuronal network CNN, 
the systolic array on processor and gate level, as well as dynamic biology- 
inspired networks. In the following, all these concepts will be discussed from 
the viewpoint of nanoelectronics. 



6.1.2 Some Considerations to Parallel Data Processing 

Parallel data processing is the most important issue in terms of performance 
improvements. Therefore, the efficiency of different parallel and timesharing 
concepts will be analyzed with respect to a single functional block F. 

The efficiency r] is inversely proportional to both the active chip area 
and the clock frequency. Parallel or timeshared functional blocks do not im- 
prove the efficiency, even if we assume that (de-) multiplexing can be neglected 
(Fig. 6.3). The ratio of throughput and chip area remains constant for both 
cases. 

A single operation F has to be split into suboperations F), which can be 
evaluated by parallel subfunctional blocks. At best the efficiency increases ac- 
cording to the number of parallel subfunctional blocks. However, it is unlikely 
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Fig. 6.3. Efficiency of different architectures in terms of their parallelism: The upper 
examples do not offer an enhanced efficiency, whereas the lower concepts improve the 
efficiency by a factor of N (degree of parallelism) if they comply with the assumption 
that the additional chip area and switching time can be neglected. According to A. 
Wieder, F represents the functionality, f stands for the frequency, and A is the chip 
area 



to sum up the subresults without any additional costs. In this context artif- 
ical neuronal networks offer a promising concept. The enhanced efficiency of 
pipeline architectures is based on the space and time parallelism structure. 
This concept assumes that the operations can be serialized into appropriate 
suboperations. Such structures can be found for instance, in systolic arrays 
and fuzzy systems. 

The simple configurations in Fig. 6.3 do not reveal the wealth of all possible 
parallel architectures, but offer a first glance at the additional costs of parallel 
data processing. Among other structures, typical processor array configura- 
tions are, for example: Ring, cylinder, sphere, and hypercube. In general, the 
decision depends on the software that has to be mapped on the architecture. 
Additionally, parallel data processing offers the opportunity of a partial fault 
tolerance on the hardware level. 



6.1.3 Influence of Delay Time 

Nanoelectronics follows the trend to high clock frequencies, since delay and 
switch times of the tiny devices are short. Clock rates of hundreds of GHz are 
already considered to be realistic. However, relevant factors tend to be missed 
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Fig. 6 . 4 . Novel computer architectures according to Tiwari [28]: The advantages 
of scaled parallel systems in terms of power consumption and cycle time. Only 
distributed computing yields a higher efficiency 



at first glance. For instance, the power supply of the clock can consume up to 
50% of the whole power consumption. Parasitic wiring capacitances have to be 
minimized in order to maintain the clock power within certain boundaries. In 
this connection, a further problem arises: Distributed parasitic capacitances 
limit the bandwidth of a wire and result in distorted signals. 




Fig. 6.5. The range of the clock signal is limited due to signal delays and distortions 



Figure 6.5 illustrates the maximum wiring length. Signal delays and dis- 
tortions can be tolerated within this limit. In this context, the estimation of 
Fig. 6.6 takes into account the wiring dimensions and the clock frequency. It 
turns out that only small areas can be reached from a single point. However, 




94 



6 Parallel Architectures for Nanosystems 



the packing density of nanodevices is very high so that many devices can still 
be accessed. 




Fig. 6.6. Maximum propagation distance as a function of the frequency with respect 
to the wire dimensions. M. Forshaw from UCL 



As is known, networks on the circuit level are sensitive to timing delays, 
but, nevertheless, the same is true for networks on the processor level. For 
instance, Fig. 6.7 illustrates a systolic processor array of 3 x 3 processors. 
Typically, the results do not appear at the same time, but emerge with a 
delay of two clock cycles. Thus, new operations can only be initiated at each 
second clock cycle and the processing speed gets limited by the time delay. 

Appropriate delay elements in the data paths omit this drawback. By 
this means, all results leave the processor array simultaneously and can be 
processed in a next step without any further delay. Therefore, delay times of 
the wiring are not an issue within such configurations. However, the result 
gets delayed. 
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Fig. 6.7. The impact of time delays can be cut down to the processing speed via 
delay elements 



6.1.4 Power Dissipation and Parallelism 

The extremes of parallel data processing will be discussed on the base of 
the SIMD architecture (Sect. 6.2.3). In this context, certain considerations of 
the MOS technology can also be applied to nanoelectronics. The degree of 
parallelism Pp can be expressed as: 



Pp = ( 6 . 1 ) 

N OP 

Nqp represents the number of operations of the single processor, whereas 
Npe stands for the number of parallel processors. If one processor takes the 
time to for the evaluation of one operation, the overall system cuts this time 
down to: 



tds = (6.2) 

Thus, the higher the degree of parallelism, the faster the evaluation of the 
task. This is of fundamental importance for all time-critical tasks. 

In terms of power consumption other considerations have to be taken into 
account. The time tds has not to be reduced, and the time to can be enlarged. 
The total power consumption comprises a dynamic and a static part: 

Plot — Pdyn + Pstat- (6.3) 

The dynamic part results mainly from the load capacitance Cl- 



Pdyn = C L V% D f c . 



(6.4) 
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In this context, Vdd refers to the power-supply voltage and f c stands for 
the clock frequency that is inversely proportional to to : 

fc = ( 6 . 5 ) 

to 

Cc is the accompanying proportionality factor. It follows from the above 
equations for the dynamic power consumption: 

P(iyn — ClPeVddPpNoP- — • ( 6 . 6 ) 

t'DS^P 

The dynamic power consumption does not depend on the degree of par- 
allelism if all parameters remain constant. However, it can be significantly 
reduced if the supply voltage is diminished. Though, this results in a slower 
gate switching which is no issue as long as the time Ids does not increase. 

The switching time of a CMOS gate can be approximated by: 



tdi = 



CliVdd 

ft (Vdd — Vr) a 



(6.7) 



Cli represents the load capacitance, /? stands for the technology constant 
and a (usually in the range of 1 to 2) accounts for the saturation effect of the 
carrier mobility. Assuming that a single operation has to pass through upe 
gates within one processor element, one can state with respect to (6.2): 



CliVdd 

PE ,8 (Vdd ~~ Vp) a 



< tdsPp- 



(6.8) 



If the application demands a specific processing time tds, Vdd can be 
expressed as a function of Pp (a ----- 2): 



Vdd = Vt+ 2PI + 



hV T b 2 

~Pp + 4P 2 ' 



( 6 . 9 ) 



In (6.9) b is equivalent to tipeCli / tdsP- Thus, a constant td s and a higher 
degree of parallelism, results in a reduced Vdd- However, Vdd can only be 
reduced as far as the transistors enter the subthreshold domain (Chap. 3), 
since the circuitry gets too slow in this domain. 

The static power consumption increases for a constant supply voltage with 
the degree of parallelism, since a larger number of devices cause more leakage 
current. 

These reflections prove the existence of an optimum degree of parallelism in 
terms of the overall power consumption P tot . The overall power consumption 
is dominated by its dynamic portion if the degree of parallelism is low because 
of the high supply voltage, whereas it is ruled by its static part if the degree 
of parallelism is high. The same is true for nanoelectronics, except that the 
static part might be neglected. 
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6.2 Architectures for Parallel Processing in Nanosystems 

The following integratable concepts reveal the feasibility and problems of 
nanosystems. 



6.2.1 Classic Systolic Arrays 

A known example is the systolic array. It is base on a regular structure 
(Fig. 6.8) and data gets pumped through this structure in a way comparable 
to blood circulation. Kung and Leiseron suggested systolic array algorithms 
that are based on modular design principles and exploit the spatial and tem- 
poral parallelism of data. The aim is to solve a given problem with p parallel 
processors, which should result in a speed up of p in comparison to a single 
processor. 

The literature already comprises many areas of systolic-array applica- 
tions. The most important ones are: Linear programming (vector and ma- 
trix operations), convolution systems, solving of partial differential equation, 
numeric (solving nonlinear differential equation, linear and nonlinear opti- 
mization, etc.) and graph algorithms. The name ‘systolic array' accounts for 
their rhythmic interaction between processor action and data transport. The 
area/time product serves as a benchmark for the modular systolic structures. 
The following example reveals the timing and communication challenges of a 
vector/ matrix multiplication array. 

The matrix W_ with its components W\ j • • • w 33 has to be multiplied by the 
input vector X (Fig. 6.8a). The output vector Y results from y 3 — wn • x\ + 
W12 • x 2 + Wi3 * x 3 , y 2 — w 2 1 • x\ + w 22 • x 2 + w 23 • x 3 and y 3 — w 3 1 • x\ + 
re 3 2 • x 2 + w w3 • x 3 . First, each component Wij of the matrix W gets stored 
in a single processor element. Then the input vector is pumped through the 
processor array During the first clock cycle, the input element x\ is multiplied 
by the element wn (Fig. 6.8b), summed up with the value on its right side 
and finally shifted to the left. In order to use the same design for all processor 
elements, zeros are inserted from the left side. During the second clock cycle 
wi 2 is multiplied by x 2l gets added to wn • aq, which conies from the left side 
and conclusively is shifted to the right side. At the same time, in the second 
row w 2 i is multiplied by aq and so on (Fig. 6.8c - f). After the third clock 
cycle the first component y± is available and after the fifth clock cycle the 
complete output vector Y is available. The next input vector Z has to follow 
directly after X so that each clock cycle delivers a new output vector. This 
means, for instance, that z\ and x 2 have to be inserted simultaneously. 

This example evaluates 27 multiplications of a vector - matrix multipli- 
cation. A single processor can only evaluate these multiplications in a serial 
fashion, whereas 27 processors would be needed for a parallel solution. The 
advantage of the systolic array is based on the fact that only three processors 
can process the weights W and the input value X in a very resourceful way. 
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Fig. 6.8. Example of the organization and function of a systolic array: matrix/vector 
multiplication as it is applied, for instance, in neuronal networks 



Taking a) into account, only six individual steps are needed, which leads to 
an area of efficient and fast implementation. 

6.2.2 Processors with Large Memories 

In the past, new processor and memory concepts led to higher performance: 
Within the same technology and at the same clock frequency, today’s RISCs 
(reduced instruction set computers) typically show a higher performance than 
today’s CISCs (complex instruction set computers). Typically, the perfor- 
mance is about a factor of 2 to 4 higher. The circuitry of a RISC is so simple 
that it can be integrated on a fraction of the chip area of a CISC. Thus, the 
production as well as the design of a RISC is less expensive and time consum- 
ing. This is why most of today’s processors are based on a RISC concept. 

The steady integration improvements have increased the ratio of the micro- 
processors and the memories speed. The drawbacks of the bottleneck between 
the processor and memory can be bypassed with a very large on-chip memory. 
Figure 6.9 shows the face plan of the IRAM (intelligent RAM) as an exam- 
ple of such a structure. The memory consumes roughly 80% of the total chip 
area whereas the remaining part is reserved for the RISC processor with an 
additional vector processor that serves for multimedia purposes. 

The maximum clock frequency can be roughly estimated as a function of 
the address space and the memory size. The bigger the address space the more 
time consuming the operation, e.g. an addition in the processor. The bigger 
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Fig. 6.9. IRAM (inteligent RAM), E. Kozyrakis of the UC Berkeley, [29] 



the memory capacitance the more extensive the memory itself and the greater 
the delay time. 

A further performance increase is attainable via multiport memories [30]. 
A single-port memory system consumes three clock cycles for one addition: 
The operand is latched in a register, then the second operand is read and 
the addition takes place and finally the result is stored. In terms of a three- 
port memory all these memory operations can take place simultaneously so 
that the addition operation, only consumes one clock cycle. Thus, the system 
speeds up by a factor of three. A six-port memory in connection with an 
additional processor results in a further speed up of about a factor of two. 
Obviously, this method enhances the computation performance, e.g. a thirty- 
port memory causes a speed up of about one order of magnitude. This concept 
of multiport memories might also be interesting for nanoelectronics. 
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Fig. 6.10. SIMD architecture of a fingerprint recognition system, S. Jung [31] 



6.2.3 Processor Array with SIMD and PIP Architecture 

Multimedia applications are the objective of the IRAM architecture, the 
same is true for SIMD (single instruction stream, multiple data stream com- 
puter) processor arrays [32], For instance, the fingerprint recognition system 
in Fig. 6.10 is based on a SIMD architecture. The same global instructions 
are fed into all processor elements that handle, for example, image-processing 
data. The global supply of all processor elements with the same instruction is 
a challenging issue for nanoelectronics, because of the extensive waring. Apart 
from the global waring and due to the regular structures and interconnects, the 
SIMD structure offers an ideal architecture for nanoelectronic-based system 
integration. 

The problematic waring loses its tension for the PIP (propagated instruc- 
tion processor) (Fig. 6.11). The PIP evolves from the SIMD concept. Within 
the PIP the operations are pumped through the network, while the PIP el- 
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ements evaluate their operations. According to this idea, the network can 
evaluate different algorithms simultaneously. 



Overall 

Control 

input 



PIP ARRAY 



H f- 


[- 


1 




1 


b 


A) 






1 


A 


An 

7 


A) 


A! 




H 1 


)i\ 
\ 1 





Control 

inputs 



Instruction 

Pipeline 

Registers 



H 



Single-Dit 
— | Processing 
Element 



PIP ELEMENT 



Fig. 6 . 11 . PIP (propagated instruction processor) architecture 



6.2.4 Reconfigurable Computer 

The RAW (reconfigurable architecture workstation) machine in Fig. 6.12 con- 
sists of a repetitive architecture. Each processing element comprises a local 
instruction unit and data memory. The complexity of a single processor ele- 
ment is equivalent to a medium-range microprocessor and roughly comprises 
two million transistors. The logic function of each processing element can be 
programmed via reconfigurable logic blocks, which is very important for na- 
noelectronics. Furthermore, programmable matrix switches permit point-to- 
point connections. The programmable connections of this concrete example 
consume about 30% of the whole chip area. 

Besides the valuable regular structure, the RAW machine also offers the 
possibility to map distinct hardware tasks into software, which generally is 
advantageous for implementation purposes. This concept is also capable of 
error-tolerant implementations, since reconfiguration and rewiring informa- 
tion can be stored in a program that might be based on benchmark data. 
A further substantial increase of the computational performance results from 
the localized reconfigurability. 

Figure 6.13 reveals the different levels of reconfigurable circuitries: Logic 
level, data-path level, arithmetic level, and control level. 

6.2.5 The Teramac Concept as a Prototype 

Probably, error-tolerant architectures have to be assumed in order to produce 
forthcoming systems comprising billions of devices in an economical fashion. 
Every computer that is based on nanoelectronics will include a considerable 
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Fig. 6.12. RAW (reconfigurable architecture workstation) machine based on a reg- 
ular and flexible structure 



number of defective devices, wires, and switches. Such a concept differs con- 
siderably from the ordinary complex computer concept. In this context, the 
Teramac concept should be mentioned as a prototype concept that has been 
built for research purposes. It is a user-configurable parallel computer machine 
that comprises many defects. However, it not only works, but also has a very 
high computational power. The concept is an inexpensive approach that is not 
based on an error-free hardware platform. First, the hardware localizes all of 
its defects and configures the parts that are in working order. Then, the pro- 
gram has to be compiled before it can be executed. Such concepts are based on 
an excessive wiring structure, which can be established by fat tree-structured 
networks (Fig. 6.14). On this basis, significant knowledge could already be 
gathered with regard to nanotechnology [33]. 

The Teramac comprises 10 6 logic units that are clocked at a relatively 
low frequency of 1 MHz . This is equivalent to 10 12 operations per second, i.e. 
tera operations per second. This feature is responsible for the Teramac’s name. 
The Teramac system has been realized in today’s microelectronic devices and 
comprises six levels. The lowest level performs the data processing by means of 
look-up tables (LUT). The upper levels connect together the LUTs and form 
more complex functional blocks. The wiring structures are designed in a highly 
redundant fashion. Programmable switches attain an enhanced flexibility. 

A first conclusion from the Teramac experiment claims to build an efficient 
computer as long as sufficient bandwidth is available to find functional com- 
ponents. A second conclusion states that the functional blocks of a computer 
need not be arranged in a regular structure, but have to have a complex wiring 
scheme with a kind of technical intelligence. Only by this means can defective 
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Fig. 6.13. Different methods for reconfigurable circuitries, J. Rabaey, UC Berkeley 
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Fig. 6.14. Architecture of the Teramac: It is based on a hierarchical structure with 
a high degree of redundancy (a). The right side (b and c) reveals the implementation 
principle of logic functions 



elements be eliminated in an effective way. The third conclusion concerns the 
switching devices: The most important components of future nanoelectronic 
circuitries will be the address bus and the matrix switches, since they rule 
the hardware configuration (Fig. 6.15). The new paradigm claims that diffi- 
cult hardware tasks should be scheduled as software tasks. This is equivalent 
to the development of the microprocessor, namely to put complex application 
programs into software. 

The conclusions of the Teramac experiment offer complete new concepts for 
nanoelectronic research: It is a top-down concept and not a technology driven 
bottom-up concept. Key components are wiring, switches, and memories as 
they guarantee efficient nanoelectronic structures. 



6.3 Summary 



The steady improvements of microelectronics result in a continuous increase of 
computational power in terms of the classic processors. On the basis of already 
gathered experiences one can claim that the simple von Neumann processor 




6.3 Summary 105 




Fig. 6.15. Configuration of the interconnection points (a and b) and the idea of fat 
tree-structured networks (c and d) 



structure will retain its importance. New system architectures for parallel data 
processing are under intensive investigations. This is where microelectronics 
comes in, because only by means of an appropriate hardware implementation 
can the advantages of, e.g. multiprocessor structures and extensive wiring 
structures, be exploited. The advantages of the hardware will increase further 
when it comes to nanoelectronics. 
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Softcomputing and Nanoelectronics 



Artificial neural networks (ANN), fuzzy systems FS), and evolutionary algo- 
rithms (EA) or genetic algorithms (GA) provide interesting approaches for na- 
noelectronic architectures. These three concepts are summarized by the term 
computational intelligence (Cl), or softcomputing. They will extend the capa- 
bilities of conventional computer systems, because softcomputing facilitates a 
better data acquisition from our everyday environment [34] . Cl methodologies 
will enable nanoelectronic systems with new and very special characteristics 
[35]. 

By applying the concept of fuzzy logic to an information-processing system 
the fuzzy controller can be realized. This controller is based on linguistic rules, 
so that human experts can insert knowledge into the system. The behavior of 




Fig. 7 . 1 . Schematic diagram of an autonomous system based on computational 
intelligence methods 
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Fig. 7.2. Examples of membership functions 



neural networks is adapted by input data supplied by the environment. These 
networks learn by adjusting the strength of connections (weights) between 
the neurons. Although the entire system turns out to be very complex, it can 
be optimized by applying genetic algorithms. By using these three methods, 
a completely autonomous system can be realized (Fig. 7.1). Autonomy is an 
important feature for nanoelectronic components, because programming these 
systems from the outside will be a serious problem [36]. 



7.1 Methods of Softcomputing 

7 . 1.1 Fuzzy Systems 



Fuzzy systems offer two important advantages for very complex nanoelec- 
tronic systems: First, the structuring and programming of these systems can 
be carried out by using linguistic data. Secondly, the processing of fuzzy val- 
ues is very robust to variations of absolute quantities, which is an important 
feature for nanoelectronic switching elements. The potentialities of the second 
advantage are barely investigated as yet. 

The concept of fuzzy or unsharp values is very familiar to nanoelectronic 
considerations. Heisenberg’s uncertainty principle and the probabilistic mean- 
ing of Schrodinger’s equation are fundamental facts. By using these equations 
we can calculate the probability of finding a particle in a specific spatial region, 
as described in Chap. 3. 

But the fuzzy theories meaning of unsharpness is different from the con- 
sideration stated above. In this case we have to distinguish between randomly 
distributed and linguistic unsharpness. Random unsharpness is modeled by 
the probability theory. Linguistic unsharpness describes the uncertainty of a 
textual characteristic, for example fast, big, small, etc. The probability the- 
ory requires that all possible events of an experiment are known and that the 
sum of independent probabilities equals one. This does not apply to linguistic 
unsharpness, which is an important difference. 

Today fuzzy logic is used as a common expression for the theory of fuzzy 
sets and the fuzzy logic itself. The theory of fuzzy sets can be regarded as an 
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Fig. 7.3. Fuzzy operations: Minimum and maximum operation 



extension of the classical set theory, which only classifies elements in matching 
and nonmatching types. This concept of matching is extended for the theory 
of fuzzy sets. The factor ji indicates the degree to which an element matches 
a particular set. Thereby 0 and 1 correspond to the classical case of matching 
and nonmatching. For the matching degrees between 0 and 1 the element only 
belongs partially to a specific set. Figure 7.2 presents fuzzy sets as mathemat- 
ical descriptions of linguistic relativization of quantities, for example a “high” 
pressure or a length of “about 1 m”, “a little shorter than 1 m”, or “a little 
longer than 1 m”. The depicted triangular functions are arbitrarily chosen. 
Normal distributions are alternatively used to define fuzzy sets, because they 
can be completely described by two parameters. These membership functions 
contain, the expert knowledge of a fuzzy system. They must be varied in such 
a way that the given task is solved as well as possible. 

The set concept extension to fuzzy sets has the advantage of coming closer 
to the human understanding than the classical set theory. It enables us to 
model complex and almost incomprehensible systems. Corresponding to the 
examples in Fig. 7.2 we can define membership functions for any object or state 
that can be described or characterized by a human verbalism. This process of 
assigning a membership function to an object or state is called fuzzyfication. 

Comparable to conventional digital logic we also have to define a set of 
basic operations for fuzzy sets. First, these operations are intersection (mini- 
mum) and conjunction (maximum), which are illustrated in Fig. 7.3. 

Computers are not only suitable as data-processing machines, but also as 
knowledge-based systems. They can process data as well as knowledge in the 
form of rules. To make this practicable we have to implement an intensive 
networked set of switching elements, which results in high demands on the 
system architecture. One realization of such a system is the fuzzy controller, 
which is briefly explained in the following: 

By introducing the fuzzy technique we can feed linguistic data into a com- 
puter. For this purpose, the fuzzy controller was developed. Today this con- 
troller is mainly employed in control engineering, which led to considerable 
improvements in process-control applications. What kind of significant effects 
does this concept have on nanoelectronics? The field of application is mainly 




110 



7 Soft computing and Nanoelectronics 



given by autonomous systems. Because the number of connecting terminals 
for large-scale systems is always restricted, we have to apply concepts that 
can structure and program themselves. 

At first we will show how rules that occur in knowledge-based systems 
can be explicitly represented by fuzzj^ relations. A fundamental characteristic 
of fuzzy logic is the linguistic, often unsharp presentation of quantities. This 
unsharpness should not be interpreted as inaccuracy, because fuzzy logic is 
absolutely capable of producing exact results. Fuzzy logic makes high demands 
on microelectronics, but also presents promising opportunities. 

For complex tasks with many input variables several rules are needed, 
whereby a premise may consist of many partial premises. For this purpose 
Maindani developed the following evaluation strategy which is illustrated in 
Fig. 7.4: 

® If a premise consists of several partial premises, the truth value is equiva- 
lent to the lowest truth value of the partial premises. 

® The conclusion is limited to the rules truth value. 

® If several rules produce a truth value > 0, then the rule conclusions will 
be superposed. 

For the control of real processes it is often necessary to convert the fuzzy 
set into a discrete correcting variable, which is called defuzzyfication. There- 
fore several methods exist, which were mainly proposed intuitively. The most 
common technique is the center-of-gravity method, which is also illustrated in 
Fig. 7.4. 

The rules shown in Fig. 7.4 apply to the simple example of charging a 
NiCd battery. A typical effect of this application is the aging of the battery 
cell, which can not be described mathematically but can be covered with lin- 
guistic rules of fuzzj^ logic. The input variables, which correspond to measured 
values, are presented as so-called singletons. The appropriate output value is 
calculated from the intersection sets by a minimum operation, which is limited 
by the truth value of the respective rule. The results of the individual rules are 
combined by a maximum operation. By using the center-of-gravity method, 
a discrete value is derived from the resulting fuzzy set. In this context the 
fuzzy controller is not described in too much detail. We will only treat the 
fundamental aspects that are relevant to microelectronic applications. 

In this simple case the output signal’s dependence on the two input vari- 
ables can be illustrated by the three-dimensional plot in Fig. 7.5. The charging 
plot produced by the fuzzy controller is single- valued and absolutely not un- 
sharp. The fuzzy controller is a nonlinear analog operational unit, which can 
be compared to an approximation system. It produces a control signal to 
near-optimally charge a battery by applying the linguistic set of rules. If a 
complete mathematical model of the battery existed, the exact charging con- 
trol characteristic could be calculated. But although this is not the case, the 
problem can still be solved approximately by using a fuzzy controller with 
specific linguistic rules. 
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Fig. 7.4. Illustration of fuzzy-system rules for the example of charging an NiCd 
battery 



According to microelectronics two aspects are very important: First, the 
control characteristic has to be described by a small set of rules determined 
by a simulator. It can be shown in practice that often less than five rules are 
sufficient. Moreover, the type of membership function should be determined 
as simply as possible. Both steps are suitable to reduce the circuit imple- 
mentation effort drastically. Secondly, only the principal characteristic of the 
charge control is needed. That is why relatively inaccurate fuzzy-gate func- 
tions, which can be realized with less effort, are sufficient to build a controller. 
This strategy becomes especially important, if we want to develop more com- 
plex systems, for example, a complete array of image processing fuzzy elements 
built from sub ~fim transistors. 
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Fig. 7.5. The charge current I according to the temperature T, and the voltage 
change dV. The nonlinear, analog output function can be constructed by the given 
fuzzy rules 



A fundamental characteristic of fuzzy rules is a probably accurate result, 
although the computation is based on uncertain data. This feature can be 
of great significance for nanoelectronic applications. Because of the very fine 
structured devices and because only a few particles are involved in data pro- 
cessing, significant parameter variations may occur. These variations also lead 
to fluctuations in output signals. Therefore techniques have to be developed 
that take these circumstances into account. 

7.1.2 Evolutionary Algorithms 

The transfer of evolution processes to technical applications is an interesting 
task in information theory. Comparable to artificial neural networks and fuzzy 
systems, genetic algorithms are inspired by biological concepts. In analogy to 
an organism adapting to a changing environment by evolution, a technical 
system could adjust itself to changing boundary conditions by artificial evo- 
lution (Fig. 7.7). For this purpose the system is represented by a data set that 
is exposed to mutation and crossover operations. The resulting data set is 
maintained if it is improved compared to the original set, otherwise it will be 
ignored (Fig. 7.6). 

This decision demands a target function: The survivability of an organism 
is the main goal of biological processes. However, the priority task of technical 
systems is the optimization of special characteristics, for example, their power 
or hardware effort. The evolutionary methodology is only useful for very com- 
plex systems that cannot be mathematically modeled, which definitely holds 
true for nanoelectronic systems. 
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Fig. 7.6. The basic principle of an evolutionary algorithm 





Fig. 7.7. The selection mechanism of an evolutionary algorithm. The picture illus- 
trates two iterations for the minimization of a two-dimensional potential function 



The method of genetic processes in applications did not arise until fast 
computer systems were available, which enabled the computation of artificial 
evolution. A major goal of nanoelectronics will be the implementation of evo- 
lutionary systems in nanoelectronics itself. Therefore evolutionary algorithms 
have to be parallelized and efficiently implemented in electronics. Thereby the 
problem of wiring will play an important role, because the data sets described 
above have to be read, altered, and transmitted by the system. 

7.1.3 Connectionistic Systems 

Connectionistic systems represent a very promising approach to processor 
arrays, which could solve some common problems of parallel processing. 
Neighboring processors of such arrays are directly connected. The connection 
strengths (weights) determine the behavior of the net. The weights depicted 
in Fig 7.8 represent the program stored inside the net, comparable to a neuron 
illustrated in Fig. 7.9. On the one hand, the connections to the weights have to 
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Fig. 7.8. A net of parallel processors, where the weights determine the connections 
between the processing elements. Two time-varying states are depicted 



be drawn, on the other hand the weights have to be loaded. The development 
and adaption of weights in biological systems is based on heredity (trans- 
mission) and learning. For technical systems a similar approach is applied: 
First, we try to calculate appropriate weights by simulation. Then, the com- 
puted values may be implemented. The finetuning of the weights is carried out 
subsequently by learning, which can be a time-consuming process. From the 
microelectronic point of view such arrays exhibit interesting features, namely 
association, self-organization, and fault tolerance. 

A special case of connectionistic systems are neural networks, which are 
derived from the biological nets of nerves, where all neurons may be connected 
amongst each other. In the following we will present four examples of neural 
networks. For each of them an implementation idea for VLSI systems exists. 
From a VLSI point of view they have the advantage of modularity and fault 
tolerance. 

Artificial neural networks are an important challenge for nanoelectronic 
implementations. First, large networks with many parallel processing neurons 
are needed for relevant applications. Secondly, neural networks have charac- 
teristics that are also important to nanoelectronics and VLSI CMOS circuits, 
for example fault tolerance and the ability of adaption. The basic principle of 
a simple neural network is illustrated in Fig. 7.9, which shows the schematic 
structure of a single neuron. The neuron weights that are determined by the 
modified Hebb rule (described in Sect. 4.1.1) are stored in the depicted cross- 
ing points. The products of weights and input signals are added and trans- 
mitted over an additional signal line and the resulting sum is compared to the 
threshold value T/ x . If the signal reaches the threshold value the output is set 
to one. 

Figure 7.10 illustrates a two-input neuron with its characteristic input - 
output surface. The defined weights divide this space into two sections. If 
we extend the examined network by a second level of neurons four different 
sections will arise in the output space. This network configuration can solve the 
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Fig. 7.9. An example of the structure of an artificial neuron 



XOR problem, which was considered to be impossible by using only neurons 
until the 1960s. If we further extend the network by a third processing level, 
every possible segmentation can be constructed and the network can be used 
as a universal mapping system. The main implementation problem of these 
networks is the task of learning. As long as no solution to this problem was 
known, neural networks were considered to be useless. Below we will present 
some learning methods that became established. 

The artificial neuron presented in Fig. 7.9 and 7.10 is very similar to a 
threshold gate. The weights determining the thresholds that are adapted by 
learning are the main difference between these two approaches. Concerning 
threshold gates, these weights have to be predetermined in an optimal form. 

A special case of neural networks are cellular networks, which are char- 
acterized by the strictly local type of neuron connections, i.e. only direct 
neighbors are linked. Cellular networks try to globally solve a given task only 
by local information processing. The local connections may consist of simple 
weights or so-called fuzzy links. 

7.1.4 Computational Intelligence Systems 

Figure 7.11 shows some applications of autonomous softcomputing systems 
[37]. The number of synapses, which equals the number of weights, indicates 
the level of integration. A measure of the networks performance is given by 
the number of connections updated per second (CUPS). In practice, today’s 
applications are implemented with VLSI devices. For the realization of com- 
plex problems, e.g. image processing for autonomous control of vehicles, more 
sophisticated technologies such as fim - CMOS are needed. 
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The hardware requirements for real intelligent and autonomous systems 
are several orders of magnitude higher. Many fast conventional processors 
cannot solve these tasks sufficiently, because the processing could not be done 
in real time and because the volume of such processor arrays is too large. 
Only networked nanoelectronic systems with a high degree of parallelism offer 
a promising way to implement such complex, applications. 



7.2 Characteristics of Neural Networks in 
Nanoelectronics 

In order to handle high complexity problems new features have to be imple- 
mented in nanoelectronic systems. Such features are local processing, fault 
tolerance, distributed storage, and self-organization. 



7.2.1 Local Processing 

Cellular neural networks (CNN) are an example of local processing. These 
simple neural networks were developed by Chua from cellular automata. The 
processors of such networks are only locally connected, whereby the weights 
are often considered as binary values (Fig. 7.12). The basic algorithm is prin- 
cipally given by: 
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(7.1) 



V(i,j, n) is the output value of the cell C(i,j) for the time step n. E(i,j) equals 
the input value of the cell C(i,j). A and B are operators, / is a nonlinear 
function, and I is a bias value. The principal function of a cell and its network 
integration is shown in Fig. 7.12. 

An application is a photodiode-matrix that delivers the input signals of 
a cellular network. To reduce the noise of a picture that was taken by the 
matrix, we have to appfy the following operators: 
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The values of a cell itself and the values of the neighboring cells are assessed by 
the operator A. The input signals, for example, the light signals scanned bv the 
photodiodes, are adapted by the operator B . The results of this operation are 
added and increased by the bias /. Then this sum is assessed by the nonlinear 
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this noise-reduction algorithm., other templates are known that are commonly 
applied to image processing tasks, e.g. edge detection, structural correction, 
etc. 

Because of their structural simplicity and because it is very difficult to com- 
pletely connect all processors of a nanometer-scale device among each other, 
cellular networks are a convenient concept for nanoelectronic or molecular- 
electronic system implementations. On the other hand, the information- 
processing capabilities of cellular networks are restricted, which means that 
this concept is limited in its practical importance. 



7.2.2 Distributed and Fault- Tolerant Storage 

An associative matrix is a very simple neural network. It is a memorj^ device 
that is capable of storing associative data words. When applying the first 
data word as an input, the best matching data word is read from the memory 
matrix and transmitted to the output. Associative and distributed storage 
are interesting concepts from the nanoelectronic viewpoint. Both ideas will be 
explained by the example of a simple and small associative memory. 

Today the data stored inside conventional microelectronic memory circuits 
can only be recalled by its address or by its content. Based upon artificial neu- 
ral networks a distributed storage, where data is saved spread over a matrix, 
can be carried out. 

An associative matrix is based on the same principle. The weights that 
are stored on the matrix points can only attain binary values, i.e. zero or 
one. In the learning phase we have to apply the associative data pairs to 
the matrix input and output. The learning algorithm will set the weight of a 
matrix intersection point to one, if both the input and the output according to 
this intersection are set to one. The matrix values are subsequently adapted 
by supervised learning. Figure 7.13 shows for a simple example, how data 
patterns are saved on the matrix. In order to read data from the matrix an 
input pattern is at first multiplied by the stored weights. Then the products of 
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1st association 2nd association 3rd association 

Fig. 7.13. Simplified example of an associative matrix with distributed storage. 
The matrix consists of 5 neurons with 5 inputs each. The weight can only attain the 
values 0 and 1 



each matrix column are added. If the resulting sum is higher than a threshold 
value, the output is set to one, otherwise to zero. In this way the output data 
word for a given input pattern can be reliable associated. 

This principal functionality can only be maintained if w r e assume a sparsely 
occupied memory matrix. If all matrix positions contain a one, the association 
algorithm would fail and the memory matrix would be useless. In contrast to 
other common microelectronic memory devices the associative data is stored 
distributed all over the matrix. Palm [38] derived equations for the calcula- 
tion of associative memory capacities and showed that such devices may hold 
96% of the capacity of classical devices. He also deduced an expression for 
the unavoidable rate of data errors that may occur for high degrees of ma- 
trix occupation. This concept is only relevant for Gbit associative memory 
matrices. 

If some of the matrix memory cells fail, the association process may still 
work as expected, i.e. the network is fault tolerant to some degree (Fig. 7.14). 
To achieve a robust fault tolerance the thresholds have to be slightly decreased, 
which in turn also leads to a decreased noise immunity. This also means that an 
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Fig. 7.14. Association and hardware errors in the matrix. The shaded cells represent 
defect cells in comparison to Fig. 7.13. If the threshold is decreased to 2 the results 
are still correct 



upcoming system failure can be detected earty (graceful degradation). Fault- 
tolerance properties will gain further importance in future electronic systems, 
because the complete functionality of those system components cannot be 
guaranteed. 

7.2.3 Self-Organization 

An additional characteristic property of neural networks is the ability of self- 
organization. A neural network can automatically adjust its weights accord- 
ing to the presented input data, so that a suitable mapping to the network is 
achieved. This property is not only interesting for information-processing pur- 
poses. It is also an important aspect for nanoelectronics, where homogeneous 
arrays of many small elements have to be structured for the desired behavior 
of the whole system. 

In 1978 a model that could explain the self-organization processes in the 
human brain was searched for, and Kohonen invented the biology-inspired al- 
gorithm of self-organizing maps (SOM) [39]. The structure of such a map 
is given by an array of processing elements (neurons) that are connected 
among each other. The n-dimensional input vectors, which are subsequently 
applied to the input of this network, are automatically mapped to their specific 
best-matching location on the two-dimensional map plane. That is how an n- 
dimensional input space is mapped by self-organization on a two-dimensional 
network plane. 

In the self-organizing maps learning phase a restricted set of correlated- 
data vectors is applied in a statistically distributed sequence to the processing 
elements of the map. For every given input pattern the map is searched for 
the so-called best matching unit (BMU). This unit is the processing element, 
which carries the reference vector that is most similar to the given input vector. 
After estimating the position of the BMU an adaption takes place, which 
affects all processing elements in a given range to the best-match position. 
This range, also called the neighborhood radius, is subsequently decreased 
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Fig. 7.15. Basic principle of a self-organizing map 
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Fig. 7.16. Mapping a three-dimensional data space 




onto a two-dimensional map 



during the learning process. That is why the map is roughly structured in the 
beginning of the learning phase, and fine tuned at the last learning steps. The 
disadvantage of this very effective parallel algorithm is the very large number 
of neuron interconnections on the map. 

At the end of the learning phase all processing elements have converged to 
specific values, i.e. adapted reference vectors. The resulting map can be divided 
into n layers, which represent the components of the input-data vectors. These 
layers are often presented as a grey-scale image (Fig. 7.15), in order to visualize 
the mapped distribution of a specific component from the input-data space. 

This self-organized network maps n-dimensional input patterns onto a two- 
dimensional plane (Fig. 7.16). The process of self-organization relates similar 
input-data, vectors to network elements that are close to each, other. This 
topology-conserving effect enables us to analyze component correlations by 
comparing the different map layers. The self-organizing maps are today used 
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for data analysis, classification, and prediction tasks. On a trial base they are 
being applied for microelectronic applications in process control and as design 
aids. The ability to detect parameter correlations also seems to be a promising 
approach for the qualit}^ assurance of semiconductor devices. 

An important goal of nanoelectronics is the realization of local processing, 
where data is only exchanged between neighboring elements. But despite the 
local processing concept, we still want a system to produce global results. 
Differential equations fulfil this need. Their computational operations are re- 
stricted to local elements but their result is strongly dependent on the bound- 
ary conditions. An example that also takes self-organization into account is 
presented next. 

The key to local processing and self-organization is given by a quantum- 
mechanical algorithm: As stated above, the difference between two information 
vectors can be treated as some kind of system energy, or a potential V. It 
reveals that a self-organization effect can be established by computing a W- 
function from a modified Schrodinger equation, and carrying out an adaption 
proportional to W 2 , see Chap. 3. By using this algorithm the input vectors are 
assigned to global minima. The important advantage of this concept is that 
not only can local minima be found, and the adaption processes are not very 
time consuming, but it works locally restricted and fully parallelized. This is 
an example of using models of solid-state physics also in system-technology 
concepts. 

The structures that are stored inside a self-organized map can be directly 
used to build nanoelectronic structures, because such networks are topology- 
preserving systems. To use this effect we have to initialize a homogeneous 
system with a given starting structure by a learning algorithm. This informa- 
tion can, for example, be stored in a nonvolatile memory device. This method 
could be extremely important for three-dimensional nanoelectronic systems, 
which cannot be produced by common lithographic structuring. The topology- 
conserving feature makes it possible to realize a given topology by using a self- 
organizing map, if the map already exists as a hardware implementation. The 
weights of processing elements that contain the structure to be implemented 
may be set by self-organization. This is how structures may be transferred 
to a self-organized system that initially was homogeneous. From these, SO Ms 
structures may be transferred to neighboring nanoelectronic systems without 
using any kind of lithographic process. 



7.3 Summary 

Soft computing offers interesting opportunities for nanoelectronics, for exam- 
ple fault tolerance, association, self-organization, local adaption, etc. Because 
these systems are modular, composed of relatively large units, they are also 
suitable for nanoelectronics. A further advantage is the possibility to build 
autonomous systems that adapt and program themselves. 
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Complex Integrated Systems and their 
Properties 



Information processing can be divided into three different layers that are re- 
flected by the following representation: The description on the mathematical 
layer, the data processing with networks that are based on ideal switching de- 
vices, data processing with an integrated system that is based on real switching 
devices. Generally, systems depend on an appropriate architecture, some of 
them have been described in the preceding chapters. 



8.1 Nanosystems as Information-Processing Machines 

8 . 1.1 Nanosystems as Functional Blocks 

Various realizations of microelectronic systems already exist. Most of them can 
be reduced to the block diagram in Fig. 8.1a. This figure outlines a system that 
comprises sensitive sensors, effective actuators at the interface to the outside 
world, low-noise amplifiers, accurate A/D and D/A converters, CPUs, and 
effective interfaces to the communication networks. 

First, the complete task and function specifications of a new 7 system have 
to be described. The function description has to be realized partially in hard- 
ware and pationally software (Fig. 8.1b). The hardware-software codesign is 
relatively complicated and needs much intuition, since the partioning is not 
easily predetermined. The algorithms have to be mapped in a safe and reliable 
fashion into the system. In the following, a closer look will be taken only at 
the hardware components. In general, the costs of the hardware components 
are low in comparison to the overall system. However, the hardware is a key 
component of the system. 

Besides safety and reliability the overall system performance is of general 
interest. To a first approximation, it can be described as information process- 
ing per unit time. Today, high performance is only attainable with integrated 
circuits, since they are fast, small, reliable, and inexpensive. 
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Fig. 8.1. Block diagram of microelectronic systems: (a) System with its interfaces 
and most important functional blocks, (b) System that has been mapped into hard- 
ware and software as well as its information and energy flow 



High system performance is the objective of system design. Figure 8.1b 
reveals the information flow from the input to the output. The accompanying 
information change will be discussed in the next section. Energy is needed for 
information processing. Thus, there exists an energy flow 7 besides the already- 
mentioned information flow. The energy has to be supplied as electric energy 
at the input, while it has to be eliminated as heat at the output. 



8.1.2 Information Processing as Information Modification 

According to the model, the system transforms the information cube to a 
usually smaller information cube (Fig. 8.2). From the viewpoint of system 
engineering the following questions arise: What is the minimum of devices 
needed to process input data into output data? What is the minimum energy 
needed to gain a higher degree of order and to simplify the data [40]? 

The NAND operation (Fig. 8.3a) comprises four input combinations (2 
bit) whereas the output has only two output states. The states of the output 
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Fig. 8.2. Information cube of information processing systems: Degree of parallelism, 
performance, and complexity 



are not uniformly distributed (0.8 bit) and it is impossible to reconstruct the 
input data from the output data. Thus, information gets lost. In this example 
information is represented by energy, which has to be dissipated when it comes 
to information destruction. From another point of view, information loss is 
another form to tidy up data. Mead and Conway [1] call this from of entropy 
modification ‘logic’ entropy. It amounts to the NAND operation to 1.2 bit. 

Besides the ‘logic’ entropy, information processing also has to deal with 
the so-called ‘spatial’ entropy. It describes information at its wrong location. 
In this context, Fig. 8.3b gives an example: A memory with eight memory 
positions should store two independent bits. This results in 28 possibilities. If 
the two bits are combined together and form a word the number of possibilities 
declines to four. This effective combination reduces the ‘spatial’ entropy by 
2.8 bit. 

In both cases energy is needed for the information reduction. Today, it is 
not yet possible to link this information change to technological data in an 
effective manner. The entropy modification AH accounts for volume changes 
of the information cube and is generally called negentropy AH. According to 
the thermodynamic model and the information AH, the minimum switching 
energy that is needed to process a single bit amounts to: 

IT = kTln2AH. (8.1) 

This is the minimum energy value. However, higher energy values are 
needed for safe operation. It has to be assured that faults caused by ther- 
mal fluctuations can be compensated by usual fault-correction mechanisms. 
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Fig. 8.3. Entropy modification in terms of information processing: (a) Logic entropy 
change of the NAND operation, (b) Spatial entropy change of a memory 



Thus, the minimum switching energy Ws amounts to 50 kT as has already 
been described in Chap. 3. 



W = 50 kTln2AH. (8.2) 

The values of today’s microelectronics exceed this value by orders of mag- 
nitudes. Nevertheless, concepts are under investigation that do not transform 
the energy of the lost information into heat. For instance, the conservative 
Fredkin gate transports the lost information to the outside and eliminates the 
heat, e.g. via an external resistance or via an antenna. 



8.2 System Design and its Interfaces 

The history of electronics reveals that each technological phase has brought a 
revolution on the system-engineering level. For instance, large-scale-integration 
(LSI) came up with the integrated circuits like transistor-transistor-logic 
(TTL) and emitter-coupled-logic (ECL), whereas microprocessor units ap- 
peared with the very-large-scale-integration (VLSI). Therefore, the transition 
from millions of integrated devices to billions of devices will probably result in 
a new system-engineering level. Methods for a global performance classifica- 
tion of nanosystems are needed. Their values have to be linked to technological 
data. 

Nanoelectronics has to offer considerably more than the already existing 
silicon technology in order to be successful. Thus, billions of integrated devices 
are needed. New design methods have to be developed for the overall design 
that starts from the single device and ends with complex systems. Technology- 
invariant interfaces have been introduced to micro- and nanoelectronics. 
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Fig. 8.4. Since 1980, the development of microelectronics profits from the separation 
of system-engineering level and process technology, thanks to Mead and Conway [1] 



A considerable milestone of microelectronics was introduced in 1980, when 
Mead and Conway separated technology from circuit design (Fig. 8.4). In this 
context, the introduction of design rules is of fundamental relevance: geometric 
design rules describe the lateral dimensions of the devices. Electric design 
rules reveal for instance, the resistance and the capacitance of the structures, 
whereas the technological design rules describe the dimensions and the doping 
of the structures. The separation of the system-engineering level from the 
process technology via technology-invariant interfaces has formed the basis for 
the extraordinary fast development and widespread use of integrated systems. 

The compiler is a very important interface between processor hardware and 
software. This interface has been adopted from computer science to the world 
of microprocessors. Therefore, the difficult programming of a microsystem via 
machine code (MC) is obsolete. 

What does a nanoelectronic interface look like? The first interface is lo- 
cated between the different technologies and the basic circuitries, similar to 
microelectronics (Fig. 8.5). A second interface between the basic circuitries 
and the hardware system is conceivable. Further, a third interface exists be- 
tween the system and the outside w r orld that concerns the hardware-software 
codesign (Fig. 8.6). For this task the designer living in the heaven of soft- 
ware has to bridge down to the technologists who are working in the hell of 
physics. Probably, on the long term, systems will be programmed by their 
outside world, i.e. systems will be adaptive and autonomous. 
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Fig. 8.6. In terms of complex systems, hardware and software have to be codesigned. 
The system demands a reliable hardware base (de Man) 



Additionally, technology-invariant system interfaces exist besides the al- 
ready mentioned design interfaces. For instance, memory applications can be 
based on different technologies. Thus, a hybrid technology can comprise, on a 
single substrate, like silicon, different technologies (Fig. 2.15). 
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8.3 Evolutionary Hardware 

Another interesting approach investigates hardware concepts that develop 
with their hardware resources and demands in an autonomous fashion. This is 
another type of biology-inspired concept. For instance, cells in a nutrient solu- 
tion form the requested network. This is only possible since each cell comprises 
the overall structure (see DNA in Fig. 5.3). 
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Fig. 8.7. Concept of an evolutionary hardware on the basis of FPGAs 



Research focuses on both evolutionary hardware concepts and their real- 
ization. One has to differentiate between off-line evolution and on-line evolu- 
tion. In terms of off-line evolution, the structuring process is simulated on a 
mainframe computer before it is mapped into the hardware. On the contrary, 
the on-line evolution takes place on the hardware level. Because of the lack 
of adequate hardware, up to now, research makes use of field programmable 
gate arrays (FPGAs) (Fig. 8.7, Pierre Marchall, CSEM [41]). The evolutionary 
hardware units rely on an appropriate wiring structure and activate adjacent 
units to solve complex tasks. Once again, the overall structure has to be stored 
in each individual cell. 
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8.4 Requirements of Nanosystems 

Figure 8.8 deals with architectures of some nanoelect ronic systems and de- 
scribes their implementations. From today’s point of view the following ap- 
plications might be realized in future: Demanding process control, strategic 
modeling, autonomous systems, self- assembling systems. This means more 
intelligence must be integrated into the systems via increasing numbers of 
devices. This is challenging for nanoelectronics. 

In terms of system architecture, the data-processing performance, the de- 
gree of parallel data processing, the power consumption, and wiring con- 
straints are of major concern. In particular, in terms of image-processing appli- 
cations, interesting approaches with distributed processor structures already 
exist. Another promising approach comes from the fields of softcomputing, 
since artifical neuronal networks lead to new concepts. 




Fig. 8.8. Integration of system architecture in the field of nanoelectronics, according 
to T. Fountain (UCL) 



Fault tolerance is an important issue when it comes to systems that com- 
prise such a huge number of devices. In this context, the following concepts 
already exist: Modular redundancy, free reconfiguration, recurrence, data re- 
dundancy, and fault-tolerant programming. In particular, the last-mentioned 
concept is of fundamental relevance for today’s microcomputers, but it would 
be inadequate for nanoelectronics. The concrete hardware realization needs 
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new concepts on the processor and memory level as well as a new data- 
processing fashion that might be based, for example on fuzzy sets. 

The system-engineering level has to deal with the challenging implemen- 
tation of the huge number of devices. The high complexity has to be cut down 
in some kind of functional blocks. From the architectural point of view, regu- 
lar and modular arrangements are advantageous. Obviously, the high packing 
densities require a very low power dissipation. Local processing and sparse 
interaction is attractive, since wiring is a critical issue in. relation to the high 
complexity. A global wiring method is almost not realistic. The similar state- 
ments are true for the linkage to the outside world: The number of pins to 
the outside is limited as well. Thus, an autonomous system should be ideal. 
From the already-mentioned system properties, further natural demands arise: 
The system should be self-testable and fault tolerant. Additionally, the sys- 
tem should be adaptive, self-optimizing in order to autoadapt itself to the 
environment. In a more general term one can state: Local processing should 
conclude in a global effort. 
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Fig. 8.9. The design of today’s complex systems comprises a large number of special 
fields, e.g. the SoC of a mobile communication system, according to de Man [42] 



However, the design of complex systems comprises many areas as Fig. 8.9 
shows for the example of communication systems on a chip. This challenge 
has to be met during education or on-the-job training. 

The system concepts of nanoelectronics are at their early stage, they are 
still under investigation. Therefore, it is impossible to deliver complete archi- 
tectures, merely visions. 
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8.5 Summary 

Information processing of nanosystems can be described as a modification 
of the information cube. The accompanying information loss contributes to 
the total power dissipation of the system. However, the major part of the 
dissipated energy results from the power supply of the switching devices. 

Technology-invariant interfaces will be of fundamental relevance for the 
design of nanosystems. In contrast to microelectronics, the interfaces will be 
shifted in the context of the engineering level towards higher complexities. 
Thus, technology-invariant interfaces have to be introduced between the func- 
tional blocks and the overall system in order to integrate different kinds of 
technologies. 
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Up to now we considered information processing as an abstract procedure. 
However, the objective is to implement the concept of an information-processing 
system into real chips. For the implementation we need an appropriate tech- 
nology that should be right for mass production. 

Technical information processing is based on real physical structures: 
Information-processing machines primarily comprise switches and wiring struc- 
tures. In this context we will show some examples of the classic semiconductor 
technology. Additionally, we will discuss the differences between an ideal and 
a real switch, as well as ideal and real wiring. 




Fig. 9.1. Abacus with discrete atoms, IBM [43] 
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9.1 Switches and Wiring 

9 . 1.1 Ideal and Real Switches 

What characteristics does an ideal switch have? In terms of the abacus it 
might be a tag that moves without friction on its bar, Fig. 9.1. In terms of 
today’s computers an ideal switch has an infinitely high resistance in its open 
state and an infinitely low resistance in its closed state (Fig. 9.2a). 

Nevertheless, a real switch can not provide bv any means this high resis- 
tance ratio: In practice the two binary states are approximated by relatively 
high and relatively low resistances. Moreover, these resistances are not con- 
stant, since they scatter around mean values. In Fig. 9.2b the shaded area re- 
veals this scattering. Furthermore, a real switch consumes during each switch- 
ing event the energy Ws- Only after consuming Ws does the switch react with 
a delay of td (Fig. 9.2c). 

In terms of thermodynamics the minimum energy for the processing of 
1 bit derives from the above considerations in connection with (8.1) and 
S - W/T: 

W - 2kTln2. (9.1) 

Even so, this value is not an absolute limit. For instance, the quantum 
computer can evaluate data without any loss of energy. In general, it is im- 




Fig. 9.2. Properties of an ideal and a real switch: (a) ideal switch with two states, (b) 
real switch with finite resistance that scatters around its mean value, each switching- 
event consumes the energy Ws that also scatters around its mean value, (c) the real 
switch reacts with a delay of td after consuming the switching energy Ws, (d) the 
current/voltage characteristics of a real switch might be linear to some extent so it 
can be exploited for amplification purposes 
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Fig. 9.3. Switching energy Ws in the Fermi distribution and in relation to the 
particles’ energy. On the right-hand side the single particle above Ws could flip a 
switch because of its thermal energy 



port ant to know that each switching event in the classical sense or on the 
macroscopic level depends on a certain amount of energy. 

The interference of thermal fluctuations with the switching processes ac- 
counts for the energy distribution of the particles: It depends on the prob- 
ability of particles that have a higher energy than the switching energy Ws 
(Fig. 9.3). It follows from the (3.30): 

P = e~^. (9.2) 

The value P in (9.2) is called the Boltzmann factor. Even for a very high 
value of Ws the probability for errors does not decline to zero, since at all times 
very few particles might have sufficient energy to overcome Ws which may 
cause errors. As will be derived in Chap. 15 technical systems use switching 
energies of at least 50 kT to be on the safe side. 

This amount of switching energy does not automatically contribute to the 
dissipated electric power during switching. One half of a capacitor’s energy is 
lost by its ohmic resistance during an instant discharge. This is also true for 
the charging of a capacitor, 
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(9.3) 



On the other hand, if the capacitor is charged in a slow way with many 
intermediate steps AV the dissipated power reduces to: 
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Fig. 9.4. Variety of nanoelectronic switching elements, M. Forshaw 



Wv = ^v\ CAV2 = \ CVAV - (9- 4 ) 

This specific switching process is called adiabatic switching and may be 
relevant to low-power circuits [44]. 

The restrictions of a real switch, cause several disadvantages within electric 
circuitries. The drift of resistance values should not cause any error, which is a 
critical issue especially for analog circuits. The delay times should not interfere 
with the signal-processing sequences: As a rule, the correct timing is ensured 
via external clocking. 

The characteristic values of swatches and gates, namely the delay time td 
and the dissipated power Py evolve from the behavior of the real switch. The 
dissipated power refers to the power that is consumed during the switching 
event. Both values form the power-delay diagram and should be kept as small 
as possible. 

According to Keyes the output of a switching element has to be isolated 
from its input in a suitable manner [45] . The three-terminal transistor complies 
this constrain whereas the two-terminal tunneling diode does not. Therefore, 
up to now two-terminal devices remain relatively insignificant. 

Figure 9.4 summarizes the landscape of nanoelectronic-oriented technolo- 
gies. The following chapters wall discuss them in detail. A steady movement 
towards smaller structures keeps device shrinking alive. Besides the high pack- 
ing density future technologies have to fulfil several criteria: Circuit design 
should be easy to handle, the yield should be very high, the supply voltage 
should be below 1 V, the circuits should have a high noise immunity, the 
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quiescent power consumption should be practically zero, and the technology 
should be suitable for mass production. 

However, microminiaturization meets its limits within the classic switching 
devices: Therefore new principles that are based on quantum effects have to 
be developed for future nanometric switching devices. These devices will not 
only have a high packing density, but probably will also show better electrical 
properties. From today’s point of view, molecular electronics that reaches up 
to the physical limits will become feasible. 

Dealing with such large numbers of switching devices as in nanoelectronics, 
one has to expect defective devices within the circuitry. The only solution for 
this problem is that redundant devices will have to replace them. At this 
solution, additional switching devices are not problematic, but the complex 
wiring for the redundant part will be. 



9.1.2 Ideal and Real Wiring 

System engineering reveals that not only advanced switching devices are 
needed, but also effective wiring structures. The properties of ideal wiring 
can be easily defined: First, it should be feasible to connect all nodes with 
each other. Secondly, the transmission properties should be ideal, i.e. no resis- 
tances, capacitances, inductances, so that the speed of light defines the time 
delay of all connections. 

Today’s microelectronics exclusively makes use of two-dimensional cir- 
cuitries that are based on multilayer wiring, in order to save area and to 
keep timing delays short. Recent technologies have to deal with high current 
densities that tend to cause electromigration, which can damage the wiring. 
Copper wires instead of aluminum wires can overcome this problem. Cryo- 
genic conduction can reduce the wiring resistance to approximately zero, but 
comes with several other problems. Capacitances can not be omitted as well 
as inductances. However, in many cases inductances can be neglected. Both 
items define the characteristic wave impedance that is in the order of a few 
tens of ohms. The characteristic wave impedance itself determines the lower 
signal levels. 

Several approaches have already been attempted for real three-dimensional 
circuit integration. Intelligent sensors are based on multilayer designs with 
integrated transistors and complex wiring structures. Other approaches build 
stacks of single chips. In this case, the interconnection facilities are limited, 
since wires can only be attached from the outside. 

In the fields of nanoelectronics some new approaches already exist. M„ 
Forshaw (UCL) for example, suggests thin silicon chips that might be in- 
terconnected with conducting polymers. The silicon substrates have to have 
perpendicular conducting channels so that they can be interconnected through 
several layers. The schematic view of Fig. 9.5, for example, presents a concept 
for an electronic eye. 
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Fig. 9.5. Layer structure of the electronic eye. The photosensitive sensors and the 
interconnects are realized with molecular electronics, whereas the circuit itself is 
based on silicon, proposal of M. Forshaw, UCL 



An electronic eye implements the most important features of an eye into 
an integrated system. Because of its complexity it is a challenge for nanoelec- 
tronics, and it is an important subsystem for robotics and traffic engineering. 
Today, electronic eyes are implemented as a two-dimensional system in silicon: 
The signal preprocessing unit is integrated beneath the photodiodes and all 
signals have to leave the circuitry in a lateral manner. This is less problematic 
for three-dimensional arrangements. 

The arrangement in Fig. 9.5 is of general interest for information-processing 
purposes. Two-dimensional information is fed into a processing layer struc- 
ture. On the long term it seems to be feasible for neuronal networks to change 
to three-dimensional architectures that would offer the opportunity of three- 
dimensional signal processing. In this context, interconnects should be dy- 
namically configurable (Chap. 4). 

Another interesting network approach firstly marks the points that should 
be connected with DNA molecules. Then, self-organizing DNA strands set up 
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Fig. 9.6. DNA molecules for wiring purposes, Storm, de Vries, Dekker, TU Delft, 
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Fig. 9.7. Clock distribution via, light: Light pulses control photoresistors, R.A. Soref 



the interconnections between the marks. The DNA network might be coated 
with metal in order to enhance the electric conductance (Fig. 9.6). Other 
promising wiring concepts are based on light beams [46]. Below, two examples 
will be given. 

Beside these two relatively complex examples, a straightforward solution 
mounts a laser diode into the chip housing. Light flashes control photoresistors 
that discharge integrated wares (Fig. 9.7). The charge itself controls transistors 
and results in electric pulses that clock the system. 

Figure 9.8a shows a reconfigurable optical network. Depending on the ap- 
plied electric fields, the input signals appear at the corresponding outputs. An 
optical connection network between two silicon chips can be realized with a 
reflection hologram. The input consists of integrated laser diodes that expose 
light on a reflection hologram. From there, the light is projected onto the pho- 
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Fig. 9.8. Optical flexible wiring: (a) controllable lens system, (b) hologram 




Fig. 9.9. Classification of different concepts in terms of parallelism, wiring and data 
transfer, fiP: microprocessor, AP: array processors, ANN: artificial network, CNN: 
cellular neuronal network, NS: objective of nanoelectronic systems 



todiodes of the second chips (Fig. 9.8b). This leads to a multiple connection 
network between both chips. 

The importance of wiring has already been discussed in Chap. 7 in connec- 
tion with the properties of complex systems. Figure 9.9 reveals the different 
system concepts in terms of parallelism, wiring, and data transfer. The mi- 
croprocessor is included in the three-dimensional diagram as a benchmark. 
The wiring complexity is relatively high for both array processors (AP) and 
artifical neuronal networks (ANN). It is relatively simple for cellular neuronal 
networks. They only serve for simple and mostly very specific tasks. The ob- 
jective of nanoelectronic systems is to establish high parallelism and global 
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Fig. 9.10. Examples of classic solid-state swatches: (a) MOS transistor, (b) bipolar 
transistor, (c) thyristor, (d) tunneling diode 



data transfer on the basis of a local wiring network. These constraints have 
to be met by future system concepts that are still under investigation. 

According to M. Forshaw 7 the following theses can be claimed for the re- 
dundancy of nanosystems: A compact two-dimensional arrangement can only 
include a few redundant cells, which results in an illdefined nanoelectronic sys- 
tem. For certain, a widespread arrangement can keep more redundant cells, 
but will be ineffective because of the widespread wiring. The only remaining 
solution for an adequate high-performance system is a three-dimensional in- 
tegration that offers convenient wiring conditions for all cells, especially for 
the redundant ones. Nevertheless, up to now^ only two-dimensional CAD tools 
exist so that three-dimensional tools still have to be developed and they will 
be quite expensive. 



9.2 Classic Integrated Switches and their Basic Circuits 

9.2.1 Example of a Classic Switch: The Transistor 

The main purpose of electronics is to control the current flow. Electrome- 
chanical switches like relays formed the starting point and have not been re- 
placed in all areas by solid-state switches, because of their enhanced electrical 
properties. Microsystems are based on solid-state switches so that mechanical 
switches have nearly no significance for them. For instants, Fig. 9.10 reveals 
the cross sections of four classic solid-state switches [47]. 

The MOS transistor consists of a p-doped region and a metallic gate that 
are separated by a dielectric material (Fig. 9.10a). A positive voltage causes, 
in connection with the field effect, a conducting electron channel at the surface 
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of the semiconductor. The conducting channel connects both n-doped regions 
of the transistor in the electrical sense. Without an applied gate voltage both 
regions are insulated from each other. Thus, the gate voltage opens and closes 
the switch, the switching process is at the surface of the chip. 

Figure 9.10b depicts the schematic view of a bipolar transistor. It consists 
of three different doped areas. The base current Ib flows in the forward direc- 
tion of the emitter-base diode. The relatively high emitter current does not 
leave the transistor at the base side, but flows mainly to the collector side, 
since only very few carriers recombine in the thin base. The emitter injects 
carriers to the collector. Without a base current there is no collector current. 
Therefore, the bipolar transistor is a switching device controlled by a current. 
Its switching process takes place in the volume of the chip. 

A four-layer npnp-structure forms the thyristor switch (Fig. 9.10c). A small 
current between the inner layers puts the thyristor into the conducting state. 
Thyristors belong to the group of power-electronic devices, however, micro- 
electronics also makes use of them: Chap. 2 describes how thyristors regen- 
erate light signals at fiber optics. Light beams can switch thyristors that, for 
instance, drive LEDs. The thyristor is advantageous since it connects fully 
through and delivers the complete output signal. This layer structure also ap- 
pears as a parasitic device within CMOS technology. The so-called latch-up 
emerges between a well and its neighboring transistors and can destroy the 
chip due to the high current by a short-circuit. 

A highly doped pn-j unction makes a tunneling diode. In general, it is called 
an Esaki diode (Fig. 9.KM and Fig. 12.7). The applied voltage controls the 
current of the tunneling diode. The control depends on two effects: First, the 
electric field deforms the potential barrier, which reduces its effective thick- 
ness. Secondly, the band edges move towards each other due to the applied 
voltage. The carries can only tunnel if the other barrier side offers a free band. 
Obviously, the position of bands has a strong impact on the current through 
the diode. Tunneling devices will probably have a strong impact on nanoelec- 
tronics, since the tunneling takes place very rapidly and without any loss of 
energ}^. 

Since the tunneling diode is a two-terminal device the enhanced switching 
properties suffer from the fact that no input-output insulation is feasible. New 
device structures avoid this drawback, for example, by combining two serially 
connected tunneling diodes. A control electrode takes care of the inner poten- 
tial voltage of the center. In this structure the resonant tunneling transistor 
is a promising candidate. 



9.2.2 Conventional Basic Circuits 

Circuit-design techniques for binary logic circuits will be taken for granted and 
only their basic principles will be reviewed in the following. The first semi- 
conductor circuits based on logic circuits had been already used by Zuse for 
his relay-logic (Fig. 9.11a). Instead of electromechanical switches, transistors 
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like the bipolar transistor (Fig. 9.11b) and field-effect transistor (Fig. 9.11c) 
had been used initially [48]. 

In some ways the transistor has poor properties in comparison to a me- 
chanical switch. For instance, gates have to deal with the problem of residual 
voltage: The active transistor causes a current that results in a voltage drop 
across the transistor. This voltage drop has to be smaller than the threshold 
voltage of the following transistor stage, otherwise the subsequent stage would 
always be activated [49]. 




Fig. 9.11. Different NAND gate realizations: (a) electromechanical switch (relays), 
(b) bipolar transistor, (c) MOS transistor 



A simple response to this problem could be a relatively high load resistance 
Rl- But this solution is disadvantageous for the charging time of the output 
capacitance Cl that gets charged via Rl . Besides the problematic residual 
voltage, one has to deal with the high power dissipation. 

Two possible solutions of this dilemma might be the following: The first 
solution passes from the static to the dynamic circuit design, technique. Thus, 
the output capacitance is only (dis-) charged if the transistor in the drive or 
load branch is activated (Fig. 9.12a). The charge on the capacitance Cl repre- 
sents the logic state of the gate. Due to charge leakage, such dynamic circuits 
depend on a minimum clock frequency so that the charges get regenerated 
before it completely disappears. 

The second solution is based on the relatively complex CMOS technology 
with its complementary transistors (Fig. 9.12b). The quiescent current of this 
circuit technique can be considered to be zero for all practical purposes. Both 
the charging and discharging of the capacitance Cl is done by low-resistance 
switches. Due to these outstanding properties, this technique is presently the 
most popular one. 

Another alternative is based on transfer of electric charge. In this context 
the field effect transistor is a very suitable device, due to its symmetric be- 
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Fig. 9.12. NAND gate with relatively low static power consumption and without 
residual voltage: (a) dynamic gate, (b) CMOS gate 



havior (Fig. 9.13). Both the drain and the source contact can serve as a signal 
input. Together with an appropriate clock supply, relatively complex gates 
can be realized with a reduced number of devices (Fig. 9.13b). A special type 
of this circuitry are the so-called switched capacitor filters. 
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Fig. 9.13. Controlled logic: (a) MOS transistor as switch and transfer element, (b) 
OR gate composed of transfer elements, (c) OR, gate on the basis of charge coupled 
devices (CCD) 



A more advanced approach gives up the transistor structures and utilizes 
charge coupled devices (CCDs). Within the CCD structures charge is moved, 
brought together, and generated at the surface of the substrate (Fig. 9.13c). 

Up to now, preferred design techniques are based on voltage-mode cir- 
cuitry. However current-inode circuitries are also an interesting approach that 
might be advantageous for future low- volt age applications. 
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9.2.3 Threshold Gates 

Threshold gates are promising concepts that are probably suitable for nano- 
electronics [50]. Threshold logic and its application possibilities had already 
been investigated before. It turned out that binary techniques are more effec- 
tive. However, this strategy might change when it comes to nanoelectronics. 




Threshold gates can evaluate any Boolean operation. For instance, Fig. 9.15 
shows the threshold values for a logic OR gate and a logic AND gate, respec- 
tively. With respect to the threshold gate in Fig. 9.14 these two logic gates 
have only two inputs, 

V(X) = sign (X - 0) = { J < 0 ( 9 - 5 ) 

N 

X = '}2 Wk ' Xk = 4 

k = 1 

W k :::::: {0, ±1, . . . , ±W 

max } 

0 = { 0 , ± 1 , . . . , ±0max} • 

Threshold gates are very suitable for full-adder designs (Fig. 9.16), due to 
their reduced logic depth and high parallelism. The circuit in Fig. 9.17 makes 
extensive use of these advantages. In this context, Chap. 12 reveals an elegant 
realization. 

The bigger the adder, the more threshold values are necessary. The range 
of values of the above example goes from 1 to 9. Since it is difficult to de- 
fine these threshold values within today’s technologies, they do not have any 
significance. In terms of nanoelectronics the number of threshold values au- 
tomatically corresponds to the number of switching devices, which solves the 
problem of the lack of precision. The longing for higher clock speeds and 
shorter cycle times in connection with higher integrated CPUs require robust 
gates that evaluate their operations within a few steps. All these claims can 
be met by threshold gates, in spite of their known drawbacks. 
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Fig. 9.15. Threshold values: threshold 0 > 1: OR gate, threshold 0 > 2: AND 
gate 



2,3 1 3 Counter 3|2 Counter 




Fig. 9.16. Examples of a one-bit full adder: Both solutions have a logic depth of 
two. The range of threshold values increases with the logic width of the adder 



In contrast to systolic system configurations, systolic arrangements on the 
bit level primarily perform simple operations. Usually, one cell receives several 
one-bit inputs from its neighbor cells. This information is processed on a low 
level. The critical path of a single cell is in the boundaries of two to three 
logic gates. On the edge of each cell several D-latches are responsible for 
synchronization purposes. This concept omits long wiring distances and is 
usually applied to pipelined adders and multipliers. 

Figure 9.17a shows two systolic 8-bit ripple carry adders. Input data and 
the sum bits are propagated in the vertical direction, whereas the carry bits 
propagate in a diagonal fashion. The adder in Fig. 9.17b reveals how the delay 
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Fig. 9. IS. Principle of the Fredkin gate: (a) Example of the Fredtkin gate, (h) 
realization 



time can be reduced by parallelizing the sum operation of two bits. Threshold 
gates proved to be suitable for these operations. 

9.2.4 Fredkin Gate 

Obviously, a conventional AND gate suffers from the information loss. It is 
not possible to reconstruct the input signals from the output signal. According 
to Chap. 8 (Fig. 8.3) the loss of information accompanies power dissipation. 
Therefore, the questions arise if it is feasible to build logic gates that do not 
suffer from information loss and if data processing always accompanies power 
dissipation. Actually, there is a gate family that processes information without 
the loss of energy: The so-called Fredkin gate. 

The Fredkin gate interlinks the input data that appear at the output in 
varied manners (Fig. 9.18a). The basic idea is to forward only the information 
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that is needed in a following stage. The unused information is collected as 
‘waste’ at a bus line that also serves as a terminal resistance. Therefore the 
Fredtkin logic remains, in its inside, cool. Within the circuit no power dissipa- 
tion appears in regard to information loss. Figure 9.18b reveals, for instance, 
an AND operation. 
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Fig. 9.19. Interaction gate: (a) schematic view, (b) Realization with balls on the 
basis of the elastic collision. According to Drexler, such arrangements are feasible in 
terms of nanomechanics 



Figure 9.19 illustrates an interaction gate on the basis of a Fredkin gate 
with balls. The elastic collision does not cause any power dissipation. The 
information garbage has to be collected and led to an output. This form of 
information processing is reversible. 
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Fig. 9.20. Typical Fredkin gate configurations: (a) A binary conservative logic gate, 
(b) A XOR gate from Fredkin gates with the garbage lines gl/g3 and the data lines 
0 and 1, (c) A unary conservative logic gate, (d) interaction gate 



At first sight, this concept seems to have only theoretical importance. 
However, it turned out to be very suitable for molecular- and nanoelectronics. 
The Fredkin gate logic prevents the heating of molecules during information 
processing. This point is an important issue, since it is very difficult to cool 
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molecular structures. Information waste is led to the edge of the gate and 
cannot heat the molecule. In terms of polymer electronics, electrons do not 
emit their energy to the molecule or its surrounding, but take it to the outside 
to the waste-bus. 

In general, it is challenging to investigate new concepts in electronics that 
exploit these principles that might lead to advantageous circuitries. 



9.3 Summary 

An excellent integration technique depends on both good switching devices 
and good interconnection structures. For many decades the same principles of 
solid-state switches, like the field effect transistor and bipolar transistor have 
been improved. However, in recent years new circuitry concepts have not been 
put into practice. Therefore threshold logic lives in the shadows. The switches 
themselves are not important for the application, but their basic circuitries are. 
As a rule, a better power-delay product results in higher hardware expenses. 
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Quantum Electronics 



Traditionally, microelectronics is based on scaling towards smaller structures. 
When the dimensions of these structures reach the nanometric scale, micro- 
electronics has already transformed to nanoelectronics. Future systems that 
are composed of these nanometric structures require both novel architectures 
and modern switching devices [51]. The behavior of these new devices can pri- 
marily only be explained with quantum-mechanical models. The second half 
of this chapter deals with some important nanoelectronic switching devices. 



10.1 Quantum Electronic Devices (QED) 

10.1.1 Upcoming Electronic Devices 



In the classical sense, a semiconductor switch depends on electrons that can be 
characterized by their elementary charge q. This simplified model is becoming 
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Fig. 10 . 1 . Charge storage: (a) liomostructure, (b) heterostructure and (c) molecule 
(simplified). The switching event of the MOS transistor is characterized by many 
electrons, whereas other devices only change the state of a single electron 
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technically obsolete when it comes to nanoelectronics. Nanoelectronics has a 
much higher packing density than conventional FET electronics, thus it is 
most important to work towards new architecture concepts. At present, the 
minimum feature size of integrated circuits is ruled by the delocalized charge 
storage. Its relatively high extension depends, amongst other things, on the 
Debye Length (1.2) and is in the range of almost 100 nm (Fig. 10.1). However, 
within the boundaries of classical physics local charge storage as is known, 
e.g. from molecules, only requires few nanometers. 
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Fig. 10.2. Some I-V characteristics of quantum electronic devices (QED) 



These dimensions can be reached in terms of very thin semiconductor and 
insulator layers. Devices that have at least one dimension in the order of 
magnitude of the electron wavelength are called mesoscopic elements. One of 
these elements is the quantum well with its characteristic thin semiconductor 
layer. Another example is the tunneling structure with its typical narrow 
insulator layer. With respect to quantum-mechanical devices, the behavior of 
electrons in mesoscopic structures will be analyzed in the following. 

Both mesoscopic and quantum-effect devices are featured with enhanced 
properties. The single-electron-transistor (SET) emerges from the MOS tran- 
sistor if it is scaled down and the two pn-j unctions are replaced by tunneling 
elements (TE). Within the SET the charge of one electron controls the current 
of single electrons. In a similar way flux-quantum devices control the num- 
ber of magnetic flux quanta in a ring structure via two tunneling elements. 
The dimensions of such a configuration need not be in the nanometric range, 
which may be advantageous. The focus of the next chapters lies on these 
technologies. 

A quite new approach is based on regular quantum-dot structures in which 
single electrons are moved locally. In the literature they are described as quan- 
tum cellular automata (QCA). Data is not propagated in the form of charge 
along metallic wires, but transformed by information patterns. The dimen- 
sions of the quantum dots are in the domain of a few nanometers, which 
results in very high packing densities and leads to a real nanoelectronic con- 
cept. Such a concept can also be realized with magnetic flux quanta, which is 
of fundamental interest. Although its realization is in the distant future, we 
will take a closer look at it in the following. 
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Quantum-effect devices show enhanced I-V characteristics. The typical 
inverter characteristics of the MOS technology can also be found within 
quantum-dot devices. In many cases the current of quantum-effect devices 
is a staircase function of the voltage. The grid of the stair function depends 
directly on physical quantities (Fig. 10.2). 

As a general rule, devices with tunneling elements (TE) has I-V charac- 
teristics with, a negative differential resistance (NDR) segment. This regime 
can be used for information storage and processing purposes. Several NDR 
regimes can be achieved with appropriate band structures that might be used 
for functional integrated gates. 

The same circuit architectures that are typically used for conventional cir- 
cuit designs can also be utilized for circuit designs that are based on quantum- 
effect devices. However, more appropriate architectures that directly benefit 
from the enhanced device characteristics have to be taken into account, e.g. 
for multivalued logic applications. 

Nevertheless, an interface has to be established between any new archi- 
tecture and the conventional Boolean algebra. In this connection, universal 
gates that can be programmed by a control signal are advantageous. A com- 
prehensive example is the majority gate. Its output depends on the majority 
value of all input values. It is an interesting gate for fault-tolerant systems. 
Additionally, it can also be considered as a universal gate: If the control signal 
is connected to the logic low level its logic function is AND. On the contrary, 
if the control signal is connected to the logic high level, its logic function is 
OR (Fig. 10.3). 

Other approaches take advantage of the material wavefunction of the elec- 
tron, which also leads to new switching elements. Below, two of these ap- 
proaches are discussed in detail. 

10.1.2 Electrons in Mesoscopic Structures 

The band diagram of a semiconductor depends on its crystal structure and on 
the material wavefunction of the electrons [47] . The band diagram reveals the 
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Fig. 10.3. Universal gate and a complex functional block 
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Fig. 10.4. Quantum structures with different dimensions: normal solid-state body, 
quantum well, quantum wire, and quantum dot, additionally their density of states 
are illustrated. The grid of the steps depends on the size of the quantum structure 



energy levels that can be occupied. The density of states of these levels depend 
on the size of the crystal. An extensive three-dimensional body produces a 
three-dimensional k-space. The energy E of the electrons can be derived from 
solid-states physics as: 



E = ^i + k l + ^ ) = ^ i \ k f. (io.i) 

With respect to the three-dimensional k-space the possible states can be 
expressed as: 



_ V T 1 (2 m,E\ 3/2 

\HT ) 

The density of states follows as: 



( 10 . 2 ) 
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(10.3) 



The density of states is proportional to the square root of the energy of 
the electrons. This function describes the well-known Boltzmann relation of 
classical physics (Fig. 10.4). 

A three-dimensional relatively large ashlar (3D potential well) has a steady 
y/E characteristic for the density of states, whereas the density of states char- 
acteristics becomes discontinuous if at least a single dimension is scaled down 
to the domain of the material wavelength of an electron. This leads to quan- 
tum layers (2D potential well) and appears, e.g. in the inversion layer of a 
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MOS transistor. Further steps lead to the quantum wire (ID potential well) 
and quantum dot (0D potential well). 

An electron space that is limited in one direction leaves an unlimited two- 
dimensional k-space for the electron. In this case of a quantum layer the energy 
of the particle is: 



E =^( k l+ k l)+ E i- (10-4) 

Along the reduced dimension only states that match with the wavelength of 
the electrons can appear. Therefore, discrete energy levels Ei appear. Because 
of the two-dimensional k-space the density of states S is directly proportional 
to E. If this expression is derived to E, dS/dE is independent of the energy 
(Fig. 10.4). 

A further step leads to the so-called quantum wire, which leaves only 
one dimension of freedom to the electron. Consequently, the energy can be 
expressed as: 



E =^ k D+ E ™- (!0-5) 

Under these conditions the density of states is for the one-dimensional 
k-space inversely proportional to the square root of the energy. 

A restriction of all three dimensions of the electron results in the so-called 
quantum dot. Under these conditions only discrete energy levels can appear: 

E = E imm . (10.6) 

In this case the diagram of the density of states shows only discrete lines, 
similar to a hydrogen atom or a molecule. 




Fig. 10.5. Quantum dot and some of its states in the E(x,y) characteristics 



Such a quantum dot can be realized from small semiconductor columns 
(Fig. 10.5). In general, the lateral dimensions are adjusted so that the energy 
value of an electron is about one electron volt. The third dimension depends 
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on a very thin epitaxial layer. With respect to this dimension the electrons 
own at least twenty electron volts. This results in a clearly arranged energy 
spectrum since vertical energy levels only appear at higher energy levels. These 
arrays can be tuned to desired wavelengths and emit sufficient power for laser 
applications. 

In this connection the relatively abrupt junction of the quantum well is 
of fundamental relevance. In general, the abrupt junction depends on specific 
doping profiles of the semiconductor. They have to be essential^ smaller than 
the material wavelength of the electrons. Such abrupt junctions can be re- 
alized in GaAs via molecular beam epitaxy. Within the silicon domain it is 
very difficult to realize these abrupt junctions. Additionally, such silicon-based 
devices require relatively low temperatures so that at present GaAs has some 
advantages over silicon with respect to nanoelectronics. 



10.2 Examples of Quantum Electronic Devices 

10.2.1 Short-Channel MOS Transistor 

Since the conducting channel of a MOS transistor is very thin, its inversion 
layer can be considered as a two-dimensional electron gas in a quantum layer. 
Therefore different stationary electron waves can be traced in its cross section 

(Fig. 10.6). 

The discrete energy levels of such a quantum layer appear at low tem- 
peratures. Experimental results reveal oscillations in the I-V characteristic of 
short-channel MOS transistors (Fig. 10.7). 

These oscillations are due to quantum effects. However, careful research 
has revealed that an exact interpretation of these effects is relatively difficult. 
They can probably be explained by the Coulomb blockade, however, some 
signs are also emerging that quantum chaos may model these effects. 





Fig. 10.6. Appropriate doping and adequate dimensions transform the inversion 
layer of the channel to a quantum layer with discrete energy levels 
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Fig. 10.7. I-V characteristic of a MOS transistor with a channel length of 30 nm: 
Quantum effects appear in terms of oscillations, G. Wirth [52]. 



10.2.2 Split-Gate Transistor 

The split-gate transistor is an interesting example of mesoscopic systems 
(Fig. 10.8). The small dimensions of the gate slit result in different wave con- 
figurations that have a direct impact on the conductance between the source 
and drain contacts. The gate voltage alters the shape of the potential well. 
Mainly, the depth of the potential well is modified so that more eigenstates can 
be occupied. Consequently, the conductivity between S and D increases. This 
increase does not take place in a continuous way, but is a staircase function 
of the gate voltage. It is of fundamental interest that the step of the staircase 
function only depends on physical quantities, namely the elementary charge 
q and the elementary quantum of action h. Therefore, parameter fluctuations 
that, in general, are problems for sub -/im circuit designers are not an issue 
for the split-gate transistor. 

The grid of the staircase function can be derived heuristically from the 
uncertainty principle : 



AEAt > h. (10.7) 

The energy is proportional both to the voltage V and to the charge 2q, 
because each state below the Fermi level can be occupied by two electrons. 
The time t depends on the transport time of an electron and can be expressed 
as q/I. 

With respect to the uncertainty principle, the conductance can be ex- 
pressed as: 
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which is equivalent to the step of the staircase function. 



( 10 . 8 ) 




Fig. 10.8. Mesoscopic element: Split-gate transistor with three different opera- 
tion conditions (a-c) and its GV g characteristics (d) that can be described by 
Schrodinger’s equation 



A similar behavior shows the von Klitzing structure that comprises a MOS 
transistor with four connections at the channel area. Within a magnetic field 
its conductivity characteristics also shows a staircase function because of the 
Hall effect and the two-dimensional quantum layer. 




Fig. 10.9. Voltage impact on a channel with a two-dimensional electron gas 



10.2.3 Electron- Wave Transistor 



A further progress of the previously described devices leads to structures in 
which the electron waves have an even more important impact. Figure 10.10 
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Fig. 10.10. An electron- wave transistor in a waveguide structure (a): (b) the channel 
is closed, (c) the channel is conducting 



depicts a transistor in which single electrons form a current flow along the x 
axis. 

Similar to high-frequency configurations a perpendicular stub line is con- 
nected to the channel. The principle of the device is quite simple (Fig. 10.11): 
The effective length of the stub line can be varied via the gate voltage. An 
effective length of the stub line that is equivalent to a multiple of the half- 
wavelength results into a short cut at the tapping. However, an effective length 
of the stub line that is a multiple of the quarter wave length has no impact 
and the electrons can pass freely through the channel. 

The effective length of the stub line is altered by an external voltage. The 
gate electrodes of two serial-connected electron- wave transistors can serve as 
inputs for an AND gate. 

Figure 10.12 shows the micrograph of a q-s witch. The electron waves travel 
from the source contact, to the left or right drain contact depending on the 
gate voltage. 

10.2.4 Electron-Spin Transistor 

Quantum electronics can also exploit the spin of a single electron. In the 
literature such a magneto-electronic approach is referred to as spintronics. 

In a kind of bipolar structure the emitter and collector consist of ferromag- 
netic layers, whereas the base is made of a semiconductor material (Fig. 10.13). 
As indicated by the arrows the magnetic layers have been magnetized by exter- 
nal magnetic fields. The spins of the electrons that enter through the emitter 
align to the magnetization of the emitter layer and travel towards the base. 
Electrons can only pass from the base to the collector if their spin aligns with 
magnetization of the collector. For a correct operation, the layer thicknesses 
have to be in the nanometric domain. All lateral dimensions can also be scaled 
down, which leads to a real nanoelect ronic device. 
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Fig. 10.11. Different operating conditions of the electron- wave transistor: (a) and 
(c) open or conducting, (b) and (d) closed or switched off 



This technology of spintronics may be very promising for memories with 
large capacities. They may replace the magnetic hard discs. 



10.2.5 Quantum Cellular Automata (QCA) 

The idea of QCAs is quite young. Its concept describes a smart way to realize 
cellular automatas in a nanotechnology. Presently, cellular arrangements of 
quantum dots count among highly innovative concepts of solid-state electron- 
ics. They are very promising, since they depend on neither metallic wiring nor 
electric currents [53]. 

The basic QCA cell consists of four quantum dots that are arranged in 
a square (Fig. 10.14). Each QCA cell comprises two electrons. They autoar- 
range in a diagonal because of their Coulomb interaction. This results in two 
different configurations that are suitable for a binary representation. Via an 
external influence it is possible to change from one configuration to the other. 
Typically, the external influence consists of an electric field, meanwhile the po- 
tential barriers have to be lowered. On the basis of this principle information 
processing will be feasible. 

Before changing the polarization of a QCA cell, the potential barriers have 
to be lowered in order to reduce the localization of the electrons and to in- 
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Fig. 10.12. Micrograph of a q- switch: Electron waves travel to the left or right 
drain contact 
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Fig. 10.13. Principle of the electron-spin transistor: A high collector current can 
only pass if the spins align with the magnetization of the collector 



crease the tunneling probability. Typically an adiabatic switching scheme is 
applied that is similar to the adiabatic charging and discharging of capaci- 
tances (Fig. 10.15). This relatively slow switching scheme is equivalent to the 
clocking scheme of logic gates and supplies energy to the QCA. During the 
switching event the localization probability of the two electrons is uniformly 
distributed on the four quantum dots. After raising again the potential barri- 
ers the desired configuration is set up with respect to the external field. 

One has to keep in mind that the two stable configurations depend on 
two electrons. These electrons have to be fed in a lateral manner into a chain 
or even grid QCA structure, which is not a trivial problem for any kind of 
technology. 

The combination of single QCA elements enables the realization of rela- 
tively complex wire and gate structures (Fig. 10.16). A simple topology exists 
out of a linear combination that might serve as an electrical wire. In this case, 
the electrons interact with their neighbor cells. In the initial state all cells have 
the same configuration. A configuration change via an external electric field of 
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Fig. 10.14. Principle of a single quantum dot 
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Fig. 10.15. Adiabatic switching events of QCAs 



the left QCA cell evokes a configuration change of all other cells. This change 
assumes that the potential barriers have been lowered before. While raising 
again the potential barriers, the cells flip into the new configuration starting 
from the left side towards the right side. So information can pass without any 
metallic wire. 
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Fig. 10.16. Examples of basic QCA elements: (a) cell, (b) wire, (c) inverter, (d) 
fanout 2, (e) majority gate 



According to this principle all basic elements of a computer can be realized. 
For example, a displacement of the cell structure results in an inverter. A T- 
shape arrangement offers a fanout of two. 




Fig. 10.17. Majority gate: Its realization with quantum dots, truth table, and its 
application as logic AND and OR gate 



A cross-shaped arrangement is equivalent to a majority gate. Its output is 
ruled by the majority value of the inputs (Fig. 10.17). A typical application 
of majority gates are fault-tolerant systems. It can also serve as a universal 
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gate, since its logic operation can be programmed. If one input is permanently 
connected to a fixed logic value the gate computes the logic OR operation for 
a permanent 1-level. For a permanent 0-level the gate performs the logic AND 
operation. 

The example in Fig. 10.18 shows a more complex QCA gate, namely a RS 
flip-flop. A relatively complex clocking scheme is necessary in order to ensure 
the right sequence of the single operations. 

At present the behavior of such quantum structures is under investigation, 
via computer simulations (Macucci, Pisa). Up to now, it turned out that the 
cells are quite sensitive and that the circuits are not very robust. Furthermore, 
errors can only be prevented if switching takes place quite slowly in order to 
guarantee the termination of the transition process. 
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Fig. 10 . 18 . QCA RS flip-flop 



Additionally, other attempts try to implement such structures as solid 
circuits. The first promising samples in silicon have been realized on top of 
an SiO ‘2 layer (Fig. 10.20). Another approach couples potential wells with 
tunneling elements. An ambitious method tries to realize the quantum-dot 
arrangement within a molecule structure (Fig. 10.19). 

QCAs are operated at low temperatures in order to suppress thermal influ- 
ences. Room-temperature operation will become feasible if the single quantum 
dot is scaled below 5 nm. Such small dimensions will make it possible to store 
and link data in cells smaller than 25 nm x 25 nm. This packing density is 
close to the technology limits of solid-state nanoelectronics. However, first the 
influences of background charges have to be controlled. 
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Fig. 10.19. Implementation of quantum-dot structures 




Fig. 10.20. Micrograph of a silicon QCA cell (University of Tubingen, Germany) 



10.2.6 Quantum-Dot Array 

A more advanced concept implements arrays of quantum dots that interact 
with their neighbors. Due to the coupling the quantum dots show an interest- 
ing dynamic behavior (Fig. 10.21). 



a) 





Fig. 10.21. Quantum-dot array: (a) technology arrangement of a quantum-dot 
array, (b) equivalent networking diagram of two coupled quantum dots with respect 
to their substrate interaction 
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Such quantum-dot arrays can be realized, e.g. with metallic islands such 
as gold clusters. They might arrange themselves in a uniform process via self- 
organization (Fig. 10.21). The dimensions of a single island are in the domain 
of a few nanometers. 
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Fig. 10.22. Self-assembling of a quantum-dot array 



This network has multiple stable states like an artificial neuronal network 
(e.g. a Hopfield network) which sets it apart from QCA networks. The islands 
can be treated as capacitances and their electrical charge can be used for 
information storage. A resistive and capacitive coupling between the islands 
is responsible for the interaction between the quantum dots. Relatively large 
islands result in a nonlinear, continuous network system that depends on the 
stored island charge. However, small islands lead to discrete energy levels due 
to the Coulomb blockade. The nonlinear behavior can be modeled by resonant 
tunneling diodes that are located between the substrate and the islands. Typ- 
ical for such quantum-dot networks are image processing at low level and as- 
sociative information-storage systems. Boolean operations have been already 
realized with, small quantum-dot networks. In principle, all problems that can 
be described with nonlinear differential equations can be solved with, these net- 
works, assuming that the correct initial and boundary conditions have been 
fed into the network. 

A further application might be the quantum computer: each dot represents 
a Qubit. If the dots are located close to each other (< 4 run) the electron states 
couple among themselves, which leads to coupled Qubits. 



10.3 Summary 

The field of quantum electronics has given rise to many interesting approaches. 
Whether they are useful or not for nanoelectronics has to be checked in the fu- 
ture. Most of these approaches are based on mesoscopic structures and exploit 
the wave properties of the electrons. 

Quantum-dot structures show interesting concepts with typical nanometric 
dimensions of less than 5 nm x 5 nm. However, for their successful operation as 
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QCAs complex wiring schemes are indispensable. Such arrays are also useful 
for quantum computing. 
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Bioelectronics and Molecular Electronics 



Nanoelectronics has various ways to go further. Presently, nanoelectronics 
moves along three main directions (Fig. 11.1): 

1. Solid-state nanoelectronics: Within very small crystal structures electrons 
are primarily ruled by their wavefunctions. Solid-state nanoelectronics fol- 
lows a top-down approach and it comes from the area of system imple- 
mentation. Such concepts have been discused in the previous chapters. 

2. The specific states of the electrons in a molecular structure have to be 
considered when it comes to molecular electronics. Molecular electronics 
is a bottom-up approach and it starts from questions like: What can be 
realized with, such basic elements? 

3. A further approach is motivated by biology. Therefore, it is called bioelec- 
tronics or it is referred to it as wet electronics. Bioelectronics is inspired by 
biology and it tries to copy certain concepts of biology. Such an approach 
has been described in Chap. 4. 



nanoelectronics molecular electronics bio electronics 

* I 

/ / 

cell membrane 

electron waves electron states wet electronics 

top-down bottom-up copy, inspire 




Fig. 11 . 1 . Rough classification of nanoelectronics 
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This chapter can only be a sort of summary of approaches to bioelectronics 
and molecular electronics. At present, developments in these fields take place 
quite rapidly so that any more complex explanation would be out of date very 
soon. Therefore, this chapter only claims to give a coarse overview on basic 
principles. 



11.1 Bioelectronics 

The membrane is a very important part of a biological cell. It can adjust 
its conductivity via ion channels over several orders of magnitude. A possible 
approach of molecular electronics might be the reproduction of such biological 
cells and use them as information-processing units. A substantial step would 
be the realization of an artificial membrane. 

Langmuir-Blodgett developed a simple method to realize artificial mem- 
branes (Fig. 11.2): A hydrophobic substance that has been added to water 
aligns as a monolayer on top of the water surface. The aligned molecules at- 
tach to a substrate via adhesion if dipped in water with the monolayer. As the 
substrate is pulled out a second molecular layer attaches to the first molecular 
layer. The second layer has the inverse orientation, which results in a molec- 
ular double layer. This double layer is quite similar to a cell membrane. For 
certain, it is not easy to process this thin cell membrane. 

Ion channels can be integrated into the double layer, just like biological 
cells (Fig. 4.6). Destined molecules can activate these channels and turn them 
into a conducting state. This behavior is equivalent to a controllable switch. 
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Fig. 11.2. Realization of artificial Langmuir-Blodgett films: (a) molecules align with 
their hydrophobic side on top of a water surface, (b) the aligned molecules attach 
to a substrate via adhesion, (c) and (d) by means of a further process step a double 
layer can be realized 
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The next step is to establish a network out of these switches that is suitable 
for information processing. 



11.1.1 Molecular Processor 

A further application of the artificial membrane is the biomolecular processor. 
The objective of molecular electronics is to process nontactile information such 
as light and electrical signals via enzymes. Figure 11.3 shows, for instance, a 
tactile molecular processor. Receptors in the input layer transform nontactile 
information into tactile signals. If the tactile signals match with the enzymes 
that are located in the output layer the specific output is activated. The pro- 
gram (software) of the molecular processor depends on the receptor molecules 
and the read-out enzymes. Just like DNA structures, the molecules have to 
match to each other. This concept can be extended to a teachable and self- 
reproducible processor. It might consist of various series-connected chambers 
that are responsible for distinct tasks. Certain principles of a tactile processor 
can also be found within the synapses of biological networks that have been 
described before. 
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Fig. 11.3. Basic principle of a tacticle molecular processor 



The molecules act as data carriers. They transport information without 
consuming practicaly any energy. Their movement is due to the Brownian 
molecular movement, which anyhow exists because of the thermal energy. This 
gratis information transport is an important issue for very large systems. In 
this connection the following relation of Einstein is of fundamental relevance: 



< r 2 >= a - 1 — 6kT—. 
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According to it the mean square distance r between a particle and its origin 
increases with the time t. 




Fig. 11.4. A micromotor drives a rotor that sorts molecules, such a system might 
be applied within the tactile processor, according to Drexler 



The factor a depends on the Boltzmann constant k, the temperature T, 
and the coefficient of friction /q that is proportional to the diameter of the 
particle. The relation describes the area of influence for a single particle. The 
more time that passes the higher the probability that the particle has ar- 
rived at its destination. Obviously, molecular processors that are based on 
this principle operate quite slowly. In this connection, highly parallel struc- 
tures compensate this drawback, as has been already explained for the DNA 
computer. Another approach tries to overcome this disadvantage by designing 
mechanical molecular structures. Drexler and others design nanomechanical 
computers that are based on the same principles as the first mechanical com- 
puters 100 years ago. A further approach is a rotor that only transports specific 
molecules (Figs. 11.4 and 11.5). According to the shape of the molecules the 
rotor sorts out specific molecules. 

The studies of Drexler and Merkle [54] describe interesting nanomechanical 
approaches. However, from the present point of view it is unlikely that future 
nanoeornputer technology will be based on such structures. Nanomachines like 
the DNA computer seem to be more promising candidates even if they are far 
in the future. 

Probably, a biomolecular computer needs more care from the user than an 
ordinary PC. From this point of view the Tamagoccis that came into fashion 
in 1998 can be considered as a huge field test. It proves that users can really 
be expected to care about their computers. 

11.1.2 DNA Analyzer as Biochip 

An interesting application for the well-being of mankind are analytical chips 
that might identify certain diseases and serve for food inspections since the 
DNA carries the whole genetic information of all living creatures. Within such 
minilabs nanostructures are of fundamental relevance for fluidics. Additionalv 
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nanomechanical structures like nanovalves and nanopumps have to be inte- 
grated (Fig. 11.6). These means make it possible to transport substances to 
their destinations on the chip and replace a microspotter. 

For the analysis itself different methods are feasible. The proteins in ques- 
tion get selected by DNA truncks. Their properties can be discovered by the 
fluorescence behavior of their molecules. In the long term, pure electronic 
methods are preferable to minimize the dimensions of the minilab and to 
avoid complicated optical setups. This case requires a number of electronic 
sensors to be operated in parallel (Fig. 11.7). 

A single sensor electrode within the array consists of an interdigitated 
gold layer. Using a microspotter or a nanofluidic system, single-stranded DNA 
probe molecules are spotted and immobilized at the gold surface. After immo- 
bilization, an analyte containing target molecules to be detected is applied to 
the whole chip and hybridization occurs in the case of matching DNA strands. 
For read-out, a redox-cycling-based electrochemical sensor principle is used: 
After a washing step, a suitable chemical substance is applied and electro- 
chemically redox-active compounds are created by an enzyme label bound 
to the target strands. Applying simultaneously an oxidation and a reduction 
potential to the interdigitated gold electrodes, a redox current between these 
electrodes occurs whose magnitude depends on the amount of double-stranded 
DNA at this sensor position. The biochip provides the results of the tests per- 
formed directly in the form of electronic signals. 
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11.2 Molecular Electronics 

11 . 2.1 Overview 

Molecules are local objects that can take different states. Observable features 
of these states can be used for information storage. Figure 11.8 depicts differ- 
ent possibilities for a binary information storage. A molecule with a number of 
unpaired spins can be aligned in a parallel or antiparallel state to the preferred 
spin condition. These spin conditions can be detected directly by their mag- 
netic interaction or by optical means. Other binary sy terns consist of molecules 
with hydrogen bonds that can be described, in general, as double minimum 
potential BMP (Fig. 11.8b). The proton can take two stable postions A and 
B. The two resulting dipole moments might represent the two logic values 0 
and 1. 

Molecules often have tw r o different configurations, like ffrans' and ‘cis’, as 
is depicted in Fig. 11.8c. A reversible configuration change might also serve as 
a binary memory. Different absorption properties can be used for distinction 
purposes. Even more interesting seems the complex of a donor- acceptor-bridge 
(Fig. 11. 8d) in which an electron is transfered from the donor to the acceptor. 
The donor is in the reduced state before the transfer and afterwards in the 
oxidized state. Both states can be used for a binary representation. 

Figure 11.9 illustrates a fundamental element of molecular electronics. An 
electrical signal controls a bistable molecule via a molecular wire. The semi- 
conductor has very small dimensions and consists of a few 7 molecules so that 
the band theory of solids is no longer applicable. At present, it is very difficult 
to switch molecular devices with electrical signals, also the read-out process 
has not been realized by electrical means. At this early stage of molecular elec- 
tronics switching is still done by light and the read-out process is established, 
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Fig. 11 . 6 . Setup of a microanalytical chip 
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Fig. 11.7. General operational principle of DNA chips 



for example, by spectroscopy. Obviously, complex information processing units 
are not yet feasible. 

Examples of bistable molecules are salicyl- aniline and viologene, whereas 
conducting polymers might serve as molecular wires. In this case information 
transfer depends on a soliton. An acryl plastic layer is put on top of a protein 
layer. Then the acryl plastic is sliced into nanornetric wires by an electron 
beam. This technique will be developed further in the future. 



11.2.2 Switches based on Fullerenes and Nanotubes 

Carbon naturally appears as graphite and diamond. Additionally, it can also 
be found in clusters of 60 carbon atoms. In general, these clusters are termed 
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Fig. 11.8. Different molecular memory structures, according to C. Mehring 
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Fig. 11 . 9 . Basic molecular electronics element: Often light is used as input signal, 
which is not suitable for complexer units 



fullerenes or C60 molecules (Fig. 11.12). Their conductance depends on their 
deformation. The current that can flow through a macromolecule increases 
for higher pressures (Fig. 11.10). Similar to the tunneling effect, the band 
structure is shifted, which effects the conductance. Piezoelectric actuators can 
be applied for deformation purposes. 

The first designs integrated macromolecules into conducting paths. A 
piezoelectric actuator applies a pressure on a single molecule (Fig. 11.11). 
This semi-mechanical transistor can be used as a switch. The active region 
of the composition is limited to the macromolecule, but the entire approach 
suffers from the relatively huge and complex actuator. Therefore, for the time 
being this switch is nothing more than a vision that lacks a VLSI concept. 
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Fig. 11.10. Molecules as controllable resistors or transistors: A piezoelectric actua- 
tor alters the resistance of a C60 molecule that is located between to metal contacts 




Fig. 11.11. The carbon-tube transistor with source, gate, and drain 



It is possible to design and simulate basic circuits that are based on these 
transistors. A spike capacitance of 10“ 15 F and a molecular resistance of 
50 MQ result in a relatively slow inverter behavior with a time constant of 
approximately 100 ns. However, memory densities can reach magnitudes far 
beyond Si-DRAM. 

Nano tubes (Fig. 11.12) offer a similar vision for switches. At present, it 
seems as if this technique might be used for a straightforward implementation 
of nanotransistors. 
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11.2,3 Polymer Electronic 

Polymers become more and more important for electronics in terms of electro- 
luminescent displays and batteries. They are also integrated into transistors 
(Fig. 11.13). Conducting and nonconducting regions can be realized via an ex- 
posure of a PANI layer. The conducting regions can be used to define source, 
drain, and gate contacts. The channel is made of the semi-insulating polymer 
PTV where as PPV suits as the gate insulator. The fabrication of such transis- 
tors is very inexpensive since they can be integrated on flexible plastic sheets. 
Circuits that are based on this technology are interesting for both wearable 
electronics and for a replacement of the well-known bar code. 

Nanoelectronics might offer possibilities to fabricate switches and con- 
nections that are based on polymers (J.C. Ellenbogen). Depending on the 
individual poljuner structure conductors and insulators can be established 
(Fig. 11.14). The idea of polymers for wiring purposes already appeared in 
Chap. 9. 

Table 11.1 compares the main features of polymers, semiconductors, and 
metallic conductors. At first glance, the poor values of the polymer materials 
are quite impressive. If we scale down the values into the nanometric regime 
and assume single-electron transportation the polymer approach catches up 
with the conventional technologies. 

Logic circuits require nonlinear device characteristics. The basic structures 
in Fig. 11.14 lead to the molecular diode in Fig. 11.15. The polymer struc- 
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Fig. 11.12. Besides graphite and diamond carbon also appears as fullerenes and 
nanotubes 
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Fig. 11.13. Polymer transistor for most inexpensive electronics, e.g. wearable elec- 
tronics 
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Fig. 11.14. Molecules as conductors and insulators 



Table 11.1. Comparison between polymers, semiconductors, and metals 
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ture is located between two gold contacts. Gold seems to be very suitable for 
interconnects between different blocks. 

A logic function like an AND gate can be realized with two diodes and 
a resistor (Fig. 11.16). This circuitry is strongly related to diode-transistor 
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Fig. 11 . 15 . Molecular structure of a diode 



logic. In principle it is possible to make up larger chips, however, it is unlikely 
that this concept will succeed. 

Using this concept, even more complex, units are feasible [55]. The discrete 
molecular switches within the logic gate are triggered via polymer wires. The 
direct transformation of conventional logic families into the molecular domain 
appears not to be reasonable, e.g. power that is dissipated during switching 
is more difficult to eliminate. A possible solution is the concept of the Fredkin 
logic that avoids the loss of information within the gate. 



11.2.4 Self- Assembling Circuits 

The three-dimensional molecular structures might be put together like a puz- 
zle. In general, the active molecules suffer from mechanical weakness so that 
a straightforward realization of a three-dimensional system is not feasible. 
Therefore, a substrate has to be fitted to the molecular structure for stability 
and insulating reasons. 

In the following, the basic idea of a technology will be explained that is still 
in its inital state. As an interesting example we will take a closer look at self- 
organizing cells that are located in a zeolite crystal. The objective of the crystal 
is to define the mechanical structure. Some zeolite crystals have a rectangular 
channel structure, in the nanometric regime (Fig. 11.17). It is possible to 
locate semiconductor dots in the cross-points of the channel structure whereas 
polymer wires can be placed into the channels. The semiconductor dots serve 
as memory cells, since they show a bistable behavior. 

This example shows that molecular electronics offers the possibility of a 
three-dimensional integration. Even for two-dimensional layouts the packing 
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Fig. 11 . 16 . Polymer structure of a logic AND gate: By reversing the diodes’ polarity 
and by applying a negative supply voltage the gate turns into a logic OR gate 



density can be increased with this technology by a factor of four. By using 
all three dimensions the capacity will increase tremendously. For instance, a 
rectangular channel structure with a grid of 10 nrn = 100 A and a volume 
of 1 cm 3 has IQ 18 cross-points. In principle, each cross-point can be used for 
information storage, assuming the controllability. 

Obviously, for these fine structures the question arises as to how they can 
be organized. Lithography means are not appliable for three-dimensional sys- 
tems. Self-organization approaches offer a viable structuring concept. One has 
to differentiate between self-organization and self-assembling . Self-assembling 
refers to crystal growth or epitaxy, whereas self-organization can be found, for 
example, in neural networks, which have been explained in Chap. 7. By elec- 
trical means, data has a direct influence on the physical structure that leads to 
a self-organization. According to an external blueprint the required circuitry 
is generated, very similar to biological systems. 

Later, molecular electronics fills the gap between nano- and microstruc- 
tures as an interface. For experimental purposes junctions can be made via the 
scanning tunneling microscope (STM), as described in the following chapter. 
For more complex connections, as appear in dense systems, lithography has 
to be developed further. 
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Fig. 11 . 17 . Three-dimensional zeolite memory: Semiconductor dots are located in 
each cross-point, whereas polymers in the channels serve as wires 



11.2.5 Optical Molecular Memories 

Figure 11.18 reveals a molecular shift register with optical clock distribution. 
Within each period the stored information moves to its neighbor. It is prob- 
lematic to deal with this optical clock distribution when it comes to more 
complex systems. 




Fig. 11.18. Molecular shift register with optical clocking, according to Haarer 



Biomolecules offer a fast and dense memory storage. Primarily they are 
very well suited for three-dimensional architectures and parallel data pro- 
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cessing. For the time being such memories can only be realized as optical 
systems since the reactions within the biomolecules depend on light. Bacteri- 
orhodopsin is an interesting molecule for data storage. This protein can absorb 
light, meanwhile it changes its internal states. According to the color of the 
light bacteriorhodopsin takes distinct states. These different states are easily 
detectable so that bacteriorhodopsin is an appropriate switch and memory 
element. 



red laser 
light 




spontaneous 

transition 



Fig. 11.19. Different states of bacteriorhodopsin after light exposure 
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During light exposure bacteriorhodopsin passes through a photochemical 
cycle, which results in structural changes of the molecule (Fig. 11.19). Green 
light transforms the protein from its inital state bR into K. From the K state 
the protein changes spontaneously into M and then into the O form. Without 
any further influence the protein changes back to its inital state bR. A pre- 
exposure of red light leads the protein to the P state, from where it changes 
rapidly into the very stable Q state. The Q state can only be left via blue- 
light exposure to the inital state. The two states bR and Q are suitable for the 
binary memory representations 0 and 1. In the following example the optical 
memory depends on these two states of the bacteriorhodopsin molecule. 

The intermediate states P and Q result from the sequential absorption of 
a green and a red photon. They are suitable for memory storage. The memory 
storage itself takes place both in parallel and in three dimensions. The media 
has a cubic shape and consists out of bacteriorhodopsin. The cube can be 
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11.20. Optical memory based on the bacteriorhodopsin molecule 



Q-slate 



exposed with batteries of lasers from two different directions (Fig. 11.20). The 
upper laser battery emits green light and starts the photochemical process of 
the bacteriorhodopsin molecules for a selected layer. After a few milliseconds 
the concentration of molecules that are in the O state has increased. Red laser 
light that comes from the left side can now write at distinct places a logic ‘1’. 
These molecules take the P and then the stable Q state, whereas the rest of 
the molecules return to the initial state. Blue light can reset the whole memory 
block. The read operation depends on the selective absorption of red light via 
the intermediate O state. First, the selected layer has to be exposed with green 
light, so that the molecules in the initial state start with the photochemical 
cycle. After tw T o milliseconds the read-laser battery has to emit faint light. 
Molecules in the Q state are transparent for the red light whereas molecules 
in the bR state appear to be opaque for the red light because the previous 
pulse of green light had put them into the O state. 

A detector registers the translucent light and reads the logic states. For 
one layer the read operation takes place in parallel. Therefore, the three- 
dimensional structure results in a short access time, a high memory bandwith, 
and a high storage density. Hopefully, one trillion bits per cm 3 are feasible. 

At present the objective is not to realize a whole computer within this tech- 
nology. First, memory applications that exist of biomolecules will be realized 
and will be connected via a technology-invariant interface to the computer 
system. Initially, single data will be stored in several molecules and in a fur- 
ther step towards even higher storage densities a single bit might be stored in 
one molecule. 
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11.3 Summary 

The suggested approach that tries to copy biological systems leads to inter- 
esting concepts. Up to now these concepts have almost no impact for today’s 
applications. This might change if a complete analytical minilab can be inte- 
grated into a single chip or if huge memories can be provided. 

In terms of molecular electronics carbon structures and polymers are the 
most promising candidates. Applications that are based on huge mass-storage 
systems probably will be rapidly revolutionized by optical molecular memo- 
ries. 
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N anoelectronics with Tunneling Devices 



From today’s point of view tunneling elements (TE) such as the resonant tun- 
neling diode (RTD) are of fundamental relevance for nanoelectronics. They are 
usually of nanometric scale in a single dimension and are promising candidates 
as precursors for future nanoscaled ULSI circuits. At present they are the most 
mature type of all quantum-effect devices . Compared to single-electron tran- 
sistors (SET) and more advanced quantum-dot architectures, resonant tun- 
neling devices are already operating at room temperature. Technological ad- 
vances, such as the development of a III- V large scale integration process, and 
the demonstration of a Si/Sio.sGeo.s/Si resonant interband tunneling diode, 
are a challenge for circuit designers to develop digital logic families and mem- 
ory arrays. 



12.1 Tunneling Element (TE) 

Among all mesoscopic switching elements the tunneling elements (TE) are 
at present of fundamental relevance for nanoelectronics. A tunneling element 
consists of two conducting materials separated by a very thin insulator. By 
means of bandgap engineering we can tune the I-V characteristics of the tun- 
nelling element in such a way that it has a region, with negative differential 
resistance (NDR). The tunneling diode (TD) and the resonant tunneling 
diode (RTD) are the most common tunneling elements. 

The basic idea behind resonant tunneling device circuit design is to ex- 
ploit the nonlinear I-V characteristics with the typical negative differential 
resistance (NDR) region. In this context the isolation of the gate input and 
output signals is of fundamental relevance and has motivated the implemen- 
tation of different three-terminal tunneling devices. The common feature of 
these different devices is the combination of electronic amplification and NDR. 
In addition to the technologically oriented research a further prerequisite for 
nanoscale integration is the investigation of suitable logic families, architec- 
tures and the development of a design framework. 
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12 . 1.1 Tunnel Effect and Tunneling Elements 



Figure 12.1 shows the schematics of two basic tunneling element config- 
urations. A convenient solution is the TE with a vertical layer structure 
(Fig. 12.1a), but technology most commonly uses horizontal layer structures. 
These layers can be precisely deposited and are in the range of 1 to 5nm, 
which roughly is equivalent to the number of S 1 O 2 layers. 




Fig. 12 . 1 . Schematic views of tunneling elements with vertical (a) and horizontal 
(b) oriented barriers 



The tunnel effect refers to particle transport through a potential barrier 
where the total energy of a classical particle is less than the potential energy. 
In the classical sense this particle transport is not possible. This effect can be 
explained if the particle is treated as a material wave. The course of the 
function in the x direction can be described by Schrodinger’s wavefunction. 
As an example Fig. 12.2 shows the solution for a specific case. The solution 
of the wavefunction must satisfy certain boundary conditions at the abrupt 
interfaces of the barrier. This leads to a certain portion of an incident wave 
being transmitted and a certain portion reflected. On the left side of the barrier 
one can observe the entering and the reflected wave. Within the barrier the 
wave is attenuated, whereas on the right side of the barrier the wave of the 
tunneled electron can be found. The wavefunction has to be continuous and 
differential and can be described by four amplitude values and four equations. 
Figure 12.3 depicts the solution of these equations and is equivalent to the 
tunneling probability of the electrons. The tunneling probability increases for 
higher particle energies. 

Due to reflections of the particle wave not all electrons can cross the bar- 
rier, even if their energy is higher than the potential energy of the barrier 
(Fig. 12.3). The electron wave can only cross the barrier without dissipation 
if the energy of the electrons is six times higher than the potential energy of 
the barrier. 

For the lower part of the function in Fig. 12.3 there is an approximation: 
The amplitudes of the W function on both sides of the barrier are proportional 
to the probability of presence of the particles. The ratio of these amplitudes 
approximates to the tunneling probability D: 
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Fig. 12.2. Tunneling effect: The material wave is attenuated within the barrier 
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A particle that has energy E can tunnel with probability D through a 
potential barrier that has height Wo and width d. The aperture probability D 
is higher for lower and thinner potential barriers. C is a constant proportional 
factor, h is Planck’s constant and in represents the electron mass. The portion 
of tunneled particles as well as the tunnel current can be gathered from the 
tunneling probability. This effect is exploited among other things within the 
tunneling element (TE) which can be approximately modelled as an ohmic 
resistor. 

An electric field distorts the barrier shape, as denoted in Fig. 12.4. The up- 
per part of the barrier gets smaller so that the tunneling probability increases. 




Fig. 12.3. Tunneling probability as a function of the normalized energy, according 
to the Schrodinger equation 
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Fig. 12.4. Example for the tunneling process: a) tunneling barrier in its ground 
state, b) tunneling element with applied voltage. 



Once the electric field is removed the barrier shape returns to its inital state 
and the tunneling probability decreases. This effect is used, for instance, in 
electrically erasable programmable read-only memories (EEPROM). 

Tunneling elements are very interesting switching elements for nanoelec- 
tronic applications because the electron transport takes place without any loss 
of energy. Additionally, the switching speed of the elements is very high, since 
the potential barriers are very thin. Physicists discuss whether the transport 
mechanism in the TE is faster than the speed of light. To what extent this 
assumption might be correct and what impact it might have on nanoelectron- 
ics remains, for the time being, an open question. A TE can also be a source 
of errors: If we think about a MOS transistor with a very thin gate oxide the 
tunneling current through the gate oxide has to be taken into account, the 
same effect can also occur between conductors that are separated bv a very 
thin insulator. 



12.1.2 Tunneling Diode (TD) 

The interesting feature of a tunneling diode is its negative differential resis- 
tance. It consists of two semiconductors that are separted from each other by 
a thin insulator. Figure 12.5a illustrates the potential barrier caused by an 
insulator. Electrons can tunnel through this barrier only if it is thin. A fur- 
ther requirement for the tunneling process is the existence of an unoccupied 
band on the other side of the barrier. The unoccupied band can pick up the 
tunneling electrons (Fig. 12.5b). The barrier thickness can be altered by an 
electric field. The tunneling process can still be impossible, even if the barrier 
thickness is very thin because of the absense of a free band at the other side 
of the barrier. For higher electric fields the influence of the barrier can be 
neglected and the common diode effect can be observed. The complete I-V 
characteristics of the tunneling diode with its negative differential resistance 
region is depicted in Fig. 12.6. 
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Fig. 12.5. Tunneling effect in semiconductors: (a) Within material that shows a 
band structure tunneling is only possible if a free band is available on the other side 
of the barrier, (b) an electric field can displace the band structures so that tunneling 
becomes impossible 



The shape of the I-V characteristics reveals the underlying physical ef- 
fect: One can observe a falling slope in the characteristics, which in general is 
referred to as the negative differential resistance (NDR). The essential param- 
eters of the TD are the peak current Ip, the valley current Iy , the peak voltage 
Vp, and the valley voltage Vy. These parameters can be derived from the band 
diagram. The ratio of Ip to Iy is important for circuit designers because of 
its direct influence on the signal amplitude. In this context the minimization 
of the valley current is a challenging issue. In the quiescent state of the gate 
this current has still to be delivered. Therefore it seems that this technique is 
only suitable for low-power applications if the valley current can be reduced 
to approximately zero. For very fast circuits the valley currents can almost be 
ignored, since the (dis-) charging of parasitic capacitances is the main cause 
for power dissipation. 

For many years the realization of the thin insulator has been a crucial issue 
and has been solved by a technological trick: Very high doping levels result 
in a degradation of the semiconductor in the sense that its band structure 
overlaps. The Esaki diode consists of a pn-j unction with extremely high doped 
semiconductor regions (Fig. 12.7b). The upper part of the valence band that 
is located in the p-type region overlaps with the lower part of the conduction 
band that is situated in the n-type region. Both regions are separated by 
a very thin space-charge region that should prevent (in the classical sense) 
any current for small voltages. Quantum mechanics explains the tunneling of 
electrons through such a thin space-charge region. Therefore one can observe 
a high current through this region even for small voltages (Fig. 12.7b). This 
current reaches a maximum value for a specific voltage. For higher voltages 
the current starts to drop because the overlap of the above-mentioned bands 
shrinks. For even higher voltages the regular diode current starts to contribute 
to the overall current, which therefore increases for a second time. Thanks to 
this technological trick it is possible to obtain an I-V characteristics with a 
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negative differential resistance region. In a first-order approximation the I-V 
characteristic can be described by the equation of van der Ziel: 

I = A" DIVlT) + Ig0 (ew - l) , V v >V>0. (12.2) 

A” is a constant, Vy symbolizes the valley voltage, and Iso stands for 
reverse saturation current. 

Among all tunneling elements the tunneling diode is the most popular one 
and was realized for the first time in 1968. It has been implemented in silicon 
and did not have any important impact on microelectronics. Only when new 
integration forms like the resonant tunneling diode (RTD) w T ere developed did 
the tunneling diode start to become successful. The resonant tunneling diode 
has several degrees of freedom, so that the I-V characteristics can be adjusted 
in a wide range. 



12.1.3 Resonant Tunneling Diode (RTD) 

The typical negative differential resistance of the tunneling diode is very no- 
ticeable for the resonant tunneling diode (Fig. 12.8). Within the RTD the 
source and drain contacts are seperated from the channel region W by tun- 
neling elements. The channel region W can be described as the potential well. 

Its behavior can be derived from the Esaki tunneling diode. The band 
structure of the channel has to be approximately on. the same level as the 
band structure of the source contact. Otherwise only a small current flows 
through the device. 
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Fig. 12.7. Tunneling diode as a solid-state switch: I-V characteristics with NDR 
(a) and its band diagram (b) 




Fig. 12.8. Cross section of the RTD: (a) with vertical and (b) with horizontal 
barriers 



A. serial combination with an ohmic resistance shows three operating points 
(Fig. 12.9): Ai and As are stable, whereas A 2 is a metastable operating point. 
The stable operating points can be used to store data, whereas the metastable 
one is intersecting for very sensitive applications such as amplification. For 
example, a faint external signal may delegate the gate to one or the other 
stable operating points. The switching takes place in a very fast way because 
of the thin barriers. 

The term ‘resonant’ refers to the behavior of the de Broglie wave in the 
quantum well. It is located between the two barriers and emerges if the en- 
ergy of the tunneling electrons is equivalent to the level in the quantum well 
(Fig. 12.10). Under these conditions the transmission T also reaches its highest 
peak. The width of the transmission chraracteristics results from the interac- 
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Fig. 12.9. Resonant tunneling diode: (a 
diagram for Vdd > 0, (c) I-V charactisti 
wuth load resistance Rl 



WA 




d) 



band diagram for Vdd = 0, (b) band 
; with three operating points, (d) RTD 
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Fig. 12.10. Resonant effect: The material wave reaches its maximum if the energy 
of the tunneling electrons is equivalent to the level in the quantum well 
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tion of the electrons with the crystal lattice. The electrons can emit or receive 
energy from the crystal lattice so that the energy condition still holds if the 
energy deviates slightly from the level in the quantum well. 




A different implementation form of the resonant tunneling diode uses three 
barriers (Fig. 12.11). In this case the typical I-V characteristics depends on 
the displacement of the two quantum wells. In contrast to the previous RTD 
type this implementation does not need any specific band structure at the 
drain and source contacts. 

Such a device can be made out of thin metal layers with even thiner insu- 
lator layers (1 ----- 2 nm) that act as quantum wells. Depending on the quantum 
well different configurations of energy levels are attainable (Fig. 10.4). 

In general, the tunneling diode has very good switching properties, but 
suffers from the drawback of being a two-terminal device. The isolation of the 
gate input and output is of fundamental relevance and cannot be achieved by 
the tunneling diode on its own. This has motivated the implementation of dif- 
ferent three-terminal tunneling devices. The common feature of these different 
devices is the combination of electronic amplification and of the effect of NDR. 
The monolithic integration of the tunneling diode with three-terminal devices 
is a prerequisite for manufacuturing such circuits. These devices, namely field 
effect transistor and the bipolar transistor offer the required input-output 
isolation and gain. 
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12.1,4 Three-Terminal Resonant Tunneling Devices 

The switching behavior of two-terminal devices like the tunneling diode can 
not be controlled via a third terminal. Without this essential feature no isola- 
tion between the gate’s input and output is feasible. Nevertheless, this isolation 
is of fundamental relevance when it comes to circuit design. 



resonant-tunnelling barrier 




Fig. 12.12. Band diagram of the RTBT 



The monolithic integration of an RTD structure into the emitter branch of 
a bipolar transistor is called a ‘resonant tunneling bipolar transistor’ (RTBT) 
which offers a solution for this scenario. In this case, the regular pn-j unction 
between the two terminals base and emitter is replaced by an RTD structure 
(Fig. 12.12). The I-V characteristics (Fig. 12.13a) reveal the amplified collector 
current with the typical NDR behavior. 

Another solution for the isolation problem is the combination of the field 
effect transistor (FET) with an RTD. Serial and parallel combinations of these 
two devices show the negative differential resistance (Fig. 12.13b). In both 
cases the I-V characteristics can be altered by the gate voltage of the FET. 

Figure 12.13c shows the schematic of a single-electron transistor (SET) 
that consists of small capacitances and two tunneling elements. Again we can 
observe NDR regions in its I-V characteristics that are typical of all quantum- 
effect devices. A closer look at the SET will be taken in the next chapter. 



12.2 Technology of RTD 

According to the schematic views of an RTD one would expect two layers of 
semiconductor material that form the quantum wells and one additional layer 
for the quantum well. Actually, an implented RTD shows this structure, but in 
a more complex surrounding. To achieve a stable assembly of these thin layers 
a relatively complex layer structure is necessary. The example in Fig. 12.16 
shows a typical RTD structure within the indium gallium arsenide technology. 
One can anticipate the challenging implementation of such a device. 

Figure 12.14 shows different implementation forms of the RTD. Its combi- 
nation with a FET can be realized in series (a, b, e) as well as in parallel (d, 




12.2 Technology of RTD 197 



The main objective of the applied research in this field is to implement 
such RTDs in a silicon or silicon-germanium technology and to transform the 
already attained knowledge into the mainstream technology. The experiences 
of the silicon technology have been gathered over many years and it may be 
feasible to exploit these experiences onwards. Later, RTDs could be easily co- 
integrated with, conventional CMOS technology. This step would accelerate the 
introduction of the RTD technology and further nanoelectronic components. 

In Fig. 12.14a and e the tunneling diode consists of an Esaki diode. The 
diode is composed of an n + -doped contact and a p + -doped diffusion region. 

Figure 12.15 depicts the micrograph and the cross section of an RTD-FET 
combination that has been realized in an indium phosphide technology. The 
micrograph reveals the present drawbacks of this approach: The RTD can be 
perfectly implemented as a mesoscopic device, whereas the remaining compo- 
nents such as wiring and contacts are still too large for any nanoelectronic 
application. Thus, negative aspects are the low packing density as well as the 
large parasitic capacitances that increase gate delay times. In this connection 
the question arises if this approach is viable under these conditions. 

This problem might be solved if it is possible to introduce RTD into the 
polymer technology. In Fig. 12.17 an RTD is composed of insulating and con- 
ducting molecules that form the tunneling elements. But nevertheless, the 
question of how to build complex circuits and architectures out of these me- 
chanically unstable elements remains open. 



a) Bipolar Quantum Resonant 
Tunneling Transistor 




b) Gated Resonant 
Tunneling Diode 




c) Single Electron 
Transistor 





Fig. 12.13. Basic configurations of RTBT, FET-RTD, and SET 
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Fig. 12.14. Some FET- RTD combinations 



12.3 Digital Circuit Design Based on RTDs 

12.3.1 Memory Applications 

There are already many proposals for memory applications and logic circuits 
that make use of resonant tunneling diodes. Figure 12.18 illustrates a static 
memory cell that is composed of serially connected RTDs and a FET. 

The FET selects the RTD combination for read and write access. During 
the write operation the signal on the bit line drives the memory node to 
the desired state. As we will see in the following, the switching operation is 
faeiiiated if the supply voltage is clocked. 



12.3.2 Basic Logic Circuits 

In the style of the classic CMOS circuitry RTD-based logic gates can be com- 
posed: Figure 12.19a shows an inverter. The tunneling diode provokes a hys- 
teresis during the switching operation. This is not a parasitic side effect, but 
increases the noise margine so that the circuit shows noise immunity up to a 
certain extent. 

Figure 12.19b and c display an OR gate. The Boolean operation is estab- 
lished with FETs in the well-known manner. In combination with the RTD 
the I-V characteristics of the OR gate are similar to that of the inverter. The 
RTD offers the advantage of high gain at the switching point, because of the 
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Fig. 12.16. Layer structure of an RTD, courtesy of F.-J. Tegude 



nve differential resistance region. Therefore the switching itself is 



,3 Dynamic Logic Gates 

ig even more advantage of the NDR region leads to promising com 
od example in this regard is the monostable bistable logic transitu 
\t (MOBILE) [56]. Its plain configuration can be found in Fig. 12d 
t years high-speed logic families based on the MOBILE have been 
1 for tunneling; devices. The MOBILE g;ate is a useudo dvnamic. cl< 
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Fig. 12.18. Static memory cell based on RTDs 



logic circuit and consists of two resonant tunneling diodes that operate in a 
monostable or bistable state depending on a clocked power-supply voltage. 
The term pseudo dynamic refers to the special circuit style in the sense that 
a clock controls the logic transition of the gate, similar to dynamic circuits. 
But in contrast to dynamic circuits, where the logic state is represented by 
the electrical charge on a capacitor, MOBILE circuits are in a static, self- 
stabilizing state due to the inherent bistability of their devices. Consequently, 
they are more robust against charge leakage, and precharging is unnecessary. 
The functionality of MOBILE-type gates is specified by embedding a logic 
input stage into the self-latching RTD pair [57]. This logic input stage can 
be used to implement a variety of different logic schemes, such as conven- 
tional boolean logic, threshold logic, and multivalued logic [58]. Depending on 
the specific device technology the input stage is composed of HFETs, surface 
tunneling transistors, RTD-Schottky diode combinations, or RTBTs. 

A significant feature of the RTD-HFET is that the peak current is inde- 
pendent of the gate voltage for Vqs > 0-2 V. Thus, if a logic high voltage 
is applied to the gate of the RTD-HFET there is no difference between an 
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RTD-HFET and the two RTDs as it is shown by the equidistant steps of the 
switching-current difference in Fig. 12.20b. 

The logic function of a MOBILE is related to the peak-current difference 
of load and driver RTD. If the MOBILE input is provided to the gate-source 
voltage Vqs of an HFET this current difference AI is determined by the trans- 
fer characteristic AI = Id = /(Vos)- Therefore, the logic function becomes 
dependent on a precise control of the HFET threshold voltage and transcon- 
ductance. At a sufficiently high input voltage (gate bias Vqs > 0-2 V) the 
drain current is limited by the RTD peak current Ip and is independent of 
the input voltage. Hence, the transistor is limited to a switching functional- 
ity and the precise addition of a current AI to the MOBILE is adjusted by 
the RTD area {AI — Ip — a • Artd ) made from the same layer sequence 
and with the technology as load and driver RTD (a — const.). This concept 
enables the manufacture of complex gates with several inputs. Especially the 
design of multiple-input threshold gates with constant weights determined by 
the input RTD area only becomes feasible [59]. In addition, the decrease of 
the drain current at high drain bias results in a lower power consumption in 
the on-state and a higher driving capability. 

To extend the computational capabilities of RTD logic gates the current- 
controlled switching of the MOBILE in connection with the RTT input stage 
allows a functional implementation of linear threshold gates (LTGs). The char- 
acteristic feature of LTGs is the parallel processing of multiple inputs. A LTG 
calculates the weighted sum % of the digital inputs Xk,k = l,...,A r . The 
weighted sum is converted into a digital output y by comparing % with a 
given threshold value 0 (Fig. 12.21). If the weights Wk and the threshold 
value 0 are selected properly the LTG computes any linear separable Boolean 
function of the N inputs. The output y of a threshold gate is given by 

f 1 if X > 0 

V(X) = sign (X - 9) = < (12-3) 

I 0 ii x < 0 

N 

X = Yh Wk ' Xk > Xk = {o> i} 

k = 1 

Wk = {0, ±1, . . . , ±w max } , 

0 = {0, ±1, . . . , ±0 max } . 

Compared to a Boolean logic gate, a threshold gate combines an internal 
multivalued computation of the weighted sum with digitally encoded input 
and output states. Actually, this capability of processing multiple input signals 
enables it to design circuits with bit-level parallelism and reduced complexity. 
Thus, our approach shows a certain similarity to multivalued logic, but differs 
in regard to the digital input and output logic states. Although not explicitly 
mentioned the programmable NAND/NOR gate of the previous subsections is 
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Fig. 12.21. Linear threshold gate y = sign(xi + X 2 — xs — xa — 0) and MOBILE 
circuit 



basically a LTG with two negative weighted inputs and a modifiable threshold 
value. 

The circuit in Fig. 12.21 shows an RTD-based threshold gate composed of 
two serially connected RTDs and four RTTs with two positive- weighted inputs 
(xi,X 2 ) and two negative- weighted inputs (# 3 , £ 4 ). The threshold value is 
implemented by modifying the RTF) anode area of the driver RTD. According 
to the current-controlled switching the weighted sum \ of a LTG is then given 
by the total input current at the output node according to KirchhofFs current 
law. If the threshold gate has M p positive- weighted and M n negative- weighted 
inputs, the total input current at the monostable-bistable transition point is 

m p M n 

AI — YwklpjVGSk) - Y, w kIp(y GSk ) • (12.4) 

k—l k—l 

Here, Ip (Vest) is the peak current of an RTD with minimum RTD area A m { n 
(weight w m i n = 1). The weight factors Wk express the area ratios between the 
RTDs with weighted inputs and the RTD with minimum area: A w = w-A m i n . 
Thus, according to the design methodology of the NAND/NOR gate, the 
weight factors Wk are equivalent to the RTD design parameters: \wk\ = A &. 
The inputs Xk of the threshold gate are set by the gate source voltage Vosk 
and load-line diagrams similar to Fig. 12.20 can be derived. In the proposed 
circuit the metastable transition is an efficient way to implement the required 
comparison function because the sign of the input current is equivalent to the 
sign of the weighted sum. Since the threshold value 0 is the peak current 
difference of the driver and load RTD Iq — O Ip the internal weighted sum 
(i.e. the input current) is finally converted into a digital output: 
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Fig. 12.22. Load-line diagram of the basic RTBT gate 



f V H for AI > I e 

Vout = { (12.5) 

[ V L for AI < I G . 

Figure 12.21 shows a universal RTD gate. The inputs determine the cur- 
rent that flows through the input RTD. When this current is higher than 
the threshold current I t h of the MOBILE gate the gate switches to the logic 
high-level. The logic function of the gate can be defined by the tuning of 
the individual input currents. Although this gate appears to be a very el- 
egant and universal implementation, it suffers from the disadvantage to be 
nonimmunized to signal fluctuations. In terms of digital VLSI design the im- 
munity against any amplitude characteristics is of fundamental importance. 
This claim seems to hold even when it comes to nanoelectronics. 



12.4 Digital Circuit Design Based on the RTBT 
12.4.1 RTBT MOBILE 

The RTBT MOBILE is based on the same principle as the HFET-RTD MO- 
BILE introduced in the previous section. Two serially connected resonant tun- 
neling structures offer a monostable-bistable element that can be controlled 
by a single clock signal [60]. 

Figure 12.22 shows the monostable-bistable transition of an RTBT input 
stage. The areas of the RTDs are chosen so that the peak current of the load 
RTDi is larger than the current of RTBT c i c k , but smaller than the current 
sum of RTBTcick and RTBT a . From this it follows that the gate switches to 
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the logic low voltage if the input RTBT a is off. If RTBT a is on, the current 
sum of RTBTdck and RTBT a exceeds the current of the load RTD i and the 
gate switches to the logic high voltage. 

The threshold is determined by the peak current of RTD\ while the input 
weight is represented by the peak current of RTBT a . An input weight of w ~ 1, 
for example, is characterized by the minimum cross section of the resonant 
tunneling structure in the emitter branch, of the HBT. This scales the peak 
current to its minimum and is symbolized by A = 1. The gate switches from 
the monostable state into the bistable state if the clock voltage V c i c k drives 
RTD { over its peak voltage. The bistable states represent the two logic levels 
0 (low) and 1 (high) . At the switching point the gate follows the upper or lower 
gate characteristics depending on the input voltage V a . The peak voltage Vp 
and the clock voltage V c i c k is 0.18 V, and —0.1 V respectively. This leads to 
a voltage swing of AV — 0.4 V . During the active clock phase the voltage 
drop V c i c k ~ Vbe ~ V ee — 0.53 V across the two resonant tunneling devices lies 
between 2V P — 0.36 V and SV P — 0.54 V and drives the MOBILE from the 
monostable into the bistable state. 

Comparing the RTBT MOBILE with the HFET MOBILE concept of Chen 
and its extension by Pacha it uses the monolithically grown RTBT input stage 
for both the signal input and the clock input. As the MOBILE is a clocked and 
current-controlled gate with integrated latch function it is well suited to the 
current-controlled RTBT. Furthermore, the lack of a reliable enhancement- 
type HFET is not an issue for the RTBT because of its built-in potential that 
depends on the base and emitter bandgap. 



12.4.2 RTBT Threshold Gate 

A linear threshold gate calculates the weighted sum % of the digital inputs 
Xk-, k — 1, .., N. By comparing x with a given threshold value 0, the weighted 
sum is converted into a digital output (Fig. 12.21). 

Figure 12.23 shows the schematic view and the micrograph of the RTBT 
threshold stage. The weight of each input is determined by the peak current of 
the corresponding resonant tunneling diode (RTD). Since the heterojunction 
bipolar transistor (HBT) is a current-driven device we use an emitter-follower 
configuration. This keeps the input impedance high and the transistor does 
not go into saturation. To realize negative weights it should be reasonable 
to add RTBT input stages into the pull-down network (PDN), but this is 
problematic for two reasons: First, the RTBTs in the PDN need a dc offset 
of at least Vbe(ou ) f° maintain 1-0 compatibility. This additional logic level 
would require a level shifter. Secondly, the RTBTs of the extra input stages 
tend to enter saturation which should be omitted for performance reasons. 
Instead we use a modified differential output buffer. As we will see later, the 
output buffer preserves the threshold gate from the drawbacks of the input 
stage in the PDN, while it enables negative weights. 
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Fig. 12.23. Schematic view and chipfoto of the RTBT threshold gate, W. Frost, 
F.J. Tegude. 



12.4.3 RTBT Multiplexer 

The steady movement towards broadband integrated services digital network 
implies an expansion of transmission capacity to a level beyond multi-terabit 
per second. High-speed time-division multiplexers are key components in 
broadband communication systems. In this connection RTBT based MOBILE 
circuits help to reduce the component counts per logic function. An increase 
of the logic functionality of the single device keeps the logic depth low and 
reduces its complexity. Figure 12.24 shows the schematic view of the suggested 
2:1 RTBT multiplexer IC. Each input stage is based on the RTBT MOBILE 
introduced before in Fig. 12.25 and behaves as a logic AND. The two inputs 
data a and data 5 are used as signal inputs whereas the inputs clck and clck 
select the individual channel of the multiplexer. Applying a logic low level 
to one of the clock inputs will lock the individual input stage in a logic low 
level, meanwhile the output of the other inverter is ruled by its data input. 




Fig. 12.24. Schematic view of the RTBT multiplexer 
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Fig. 12.25. Load-line diagram of a single input channel 



The output of the MUX is established by a modified ECL buffer. The two 
parallel transistors HBT a and HBT^ can share the same branch of the ECL 
buffer, since they are never activated simultaneously. Considering the design 
methodology the RTBT-MUX consists of simple input stages being used as 
basic building blocks. Hence, a regular design structure is maintained and the 
RTBT-MUX circuit is characterized by a low output impedance and a high 
input impedance. 

Multiplexers for the next generation of digital receivers, ATM switches, 
and buffers depend on such highly integrated functional blocks. 
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Fig. 12.26. Micrograph of the RTBT multiplexer (A « 125 x 130 fim 2 ), F.J. Tegude 
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The micrograph of the RTBT multiplexer can be seen in Fig. 12.26. The 
symmetrical format as well as the separation of the input and output stage 
can also be traced in the implementation of the design. Due to the vertical 
structure of the monolithically grown RTBT it can not be distinguished in 
the top view from the HBT. 



12.5 Summary- 

Tunneling devices are controlled by the applied voltage. There are two effects: 
First, the electric field distorts the potential barrier so it becomes thinner and 
the tunneling current larger, secondly, the potential drop moves the allowed 
bands so the current changes or stops. The combination with a transistor 
overcomes the disadvantage of the only two-terminal device. 

From the present state of technology, circuits with tunneling elements are 
the first step into nanoelectronics. At the moment the special technologies do 
not offer the chance to realize lateral nanostructures since this technology is 
not so well developed as the silicon technology because of economic reasons. 
Therefore the main applications of tunneling elements are fast logic circuitries 
and fast memories. 
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Single-Electron Transistor (SET) 



One of the most exciting challenges of microelectronics is, of course, the vision 
of a switching device that can control single electrons. The idea of the single- 
electron transistor is based on the so-called Coulomb blockade . This effect only 
appears within very small device dimensions and is due to the quantization 
of charge [61]. 



13.1 Principle of the Single-Electron Transistor 

13 . 1.1 The Coulomb Blockade 

To explain the Coulomb blockade we take a closer look in Fig. 13.1 at the 
composition of a tunneling element and a capacitor. The capacitor is located 
on the right side of the tunneling element and has a very small capacitance 
due to its very small dimensions. The energy level of electrons within the 
capacitor can be adjusted by an external gate voltage Vq. 

Electrons can tunnel from the right side of the barrier to its left side as long 
as the energy level on the right side remains lower than on the other side. If this 
is not the case, the electrons get blocked by the Coulomb blockade, since the 
charge of a single electron would cause an increase of the energy level. This 
remarkable increase is due to the very small capacitance of the capacitor. 
Electrons can only tunnel if the energy balance is positive after tunneling. 
This effect can be explained within the boundaries of classical physics by 
Coulomb’s law, however, it assumes the quantization of the electrical charge. 
The tunneling element is needed for a galvanic isolation, but it nevertheless 
offers a voltage-controlled tunneling probability. 

The following discourse derives the most important equations of the single- 
electron transistor. The total energy E depends on the amount of charge on 
the capacitor and on the electricical potential V. The energy of a capacitor C 
that is charged with n electrons is equivalent to: 
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x 



Fig. 13.1. Schematic cross section of the composition Tunneling element and ca- 
pacitor’ as well as its energy diagram, case A: the electron cannot tunnel since its 
potential energy would be higher afterwards, case B: the electron tunnels because the 
electron’s energy balance is positive after tunneling, however a further electron gets 
blocked again by the Coulomb blockade. The energy shift from A to B is controlled 
by the gate voltage Vg (13.3) 




0 1 2 3 VC 

q 

Fig. 13.2. Equivalent circuit diagram of the composition Tunneling element and 
the capacitor’ with Q — VC as well as their Q-V characteristics for the normalized 
voltage 




The energy difference on the capacitor C for the two charge situations of 
n and n+1 electrons corresponds to: 



E n +i — E n — J~^(2n + !)• (13.2) 

An electron can only pass through the barrier via tunneling if this energy 
difference is less than orequal to the energy of an electron. The energy of an 
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electron can be expressed as: 

E = qV. (13.3) 

This implies that the voltage V should not exceed the following value: 

q(2n + 1) 



V 



2C 



n = 0,1, 2, 3,... . 



(13.4) 



For the normalized voltage the location of the steps is equivalent to: 



VC 1 

— = -(2n+l). (13.5) 

q z 

According to n = ® = 0, 1, 2, 3, ... Fig. 13.2 reveals the stair function: Each 
step originates from the tunneling of a further electron onto the capacitor. If 
the capacitor is scaled down towards a quantum dot, distinct energy levels 
arise due to the de Broglie waves of the electron. This effect of quantum 
confinement goes beyond the Coulomb blockade. 



13.1.2 Performance of the Single-Electron Transistor 

The SET consists out of two tunneling barriers and a small island (Fig. 13.3). 
The gate voltage controls, via the Coulomb blockade, the current that flows 
from the source to the drain contact. In contrast to devices such as RTDs 
and FETs the switching behavior of the SET depends on single elementary 
charges. For an explanation we take a closer look at the equivalent circuit 
diagram in Fig. 13.4. 




source drain 

Qi 

q 

n+l- 



n-2 1 

I ^ 

b) x 

Fig. 13.3. Setup of the SET and its corresponding energy diagram 



Starting from the above-derived relation we can state the voltage condition 
that has to be met for tunneling if Vqs = 0 : 
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GQVgs 




Fig. 13.4. Equivalent circuit diagram, the island P is located in between the tun- 
neling elements 



Vp T = 



Q 

2 (C + C G ) ' 



( 13 . 6 ) 



Since the island owns has as much voltage as the drain contact the current 
Ids starts to flow at the voltage: 



Vds 



Q 

C + Cq' 



( 13 . 7 ) 



Figure 13.5 refers to this threshold voltage as case 1. The corresponding 
characteristic is symmetrical about the zero point. The current vanishes in. the 
so-called Coulomb gap. The voltage Vps between the island P and the source 
contact S depends on Vg- 



y ps = ——— v G . ( 13 . 8 ) 

However, a current can only flow if this voltage corresponds to the above- 
derived threshold voltage. Under these circumstances, it follows from these 
two equations: 



V G 



Q 

2 Cg 



( 13 . 9 ) 



Figure 13.5 refers to this as case 2. For this gate voltage the Coulomb gap 
disappears. Figure 13.5 also depicts the threshold voltage Vp as a function of 
the gate voltage Vo- The interplay of charge packets results in a symmetrical 
alignment of periodic and even functions. 

However, for the complete description of the overall transistor performance 
there are more parameters to be considered: 



1. Charge that is located in the surroundings of the transistor has an im- 
pact on its threshold voltage and may cause an uncontrolled drift of the 
transistor’s threshold voltage. At present this is considered to be a serious 
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Fig. 13.5. Characteristics of the drain current versus the drain voltage and of the 
threshold voltage characteristics versus the gate voltage for a SET 



problem, since the background charge is, in many cases, time dependent. 
It appears as some kind of noise. A background charge of Qo results in a 
threshold voltage drift of: 



AV = 



Qo 

c + c G : 



(13.10) 



2. As a further restriction the thermal energy has to be considerable lower 
than the electrostatic energy Ec- 



E c = ~j » kT. (13.11) 

This constraint can already be met with moderate-scaled devices if the 
circuit is cooled. If the capacitances are scaled down in a more aggressive 
manner the energy steps will increase in such a way that the circuit can 
be operated at higher temperatures. Consequently, if the capacitance of 
the island can be scaled down to a capacity of roughly 10“ 18 F roomtern- 
perature operation will become feasible. 

3. The switching behavior depends on the time constant of the tunneling 
process and is dominated by the tunneling resistance and the overall ca- 
pacitance: 



t t = R t C. (13.12) 

As a further constraint the electrons should cross the barrier because of the 
tunneling process and should not interfere with the uncertainty principle: 

E c t t » h. (13.13) 

The energy E c has to be considerable higher than the thermal energy as 
explained in Chap. 4: 

E c = hov 2 » kT. 

Therefore, the tunneling resistance is restricted to: 



(13.14) 
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Rt » 2-^. (13.15) 

q 

This quantity is called the von-Klitzing resistance and evaluates to: 

R K = \ = 25.8 kQ. (13.16) 

T 

Obviously, SET circuits show a higher impedance than conventional 
CMOS circuits. This is not a critical issue since the accompanying ca- 
pacitances are relatively small. 

The above mentioned time constant can be evaluated at T — 6 K: 

tt > jh ~ 10” 11 S. (13.17) 

For a capacitance of C = 2 x 10~ 16 F the time constant evaluates to: 

T T = 2 R k C = 2 x lir 11 s. (13.18) 

Both cases show almost the same time constant. For application pur- 
poses it will be about 1 ns, which is quite small for such a high transistor 
impedance. 

The presented model of SET is an idealistic one, real circuit simulation needs 
a more sophisticated model [62]. 

13.1.3 Technology 

At first glance it seems to be difficult to fabricate SETs consisting of two 
tunneling elements and a small island. The following three concepts will reveal 
the feasibility of the SET technology. The technological challenge is to find 
the appropriate way to bring one of these concepts into mass production [63] . 

The first concepts uses a batch process to coat small metal balls with an 
insulator. Such a ball can be placed on top of an interrupted conductor. The 
gate might be integrated into the substrate (Fig. 13.6a). The objective of such 
concepts is to achieve some kind of self-organization on the level of atoms or 
molecular clusters. 

Another concept realizes the appropriate tunneling elements via different 
deposition angles (Fig. 13.6b). First, A1 is deposited from the right side so that 
its oxide can directly be used as a tunneling barrier. In a second process step 
A1 is applied from the left side as depicted in the schematic view of Fig. 13.6b. 
The two applied A1 layers overlap and form the tunneling element. 

In silicon the SOI technique can also be used to implement SETs. As 
Fig. 13.7 reveals the essential elements such as tunneling barriers and islands 
can be realized within a very thin Si layer. The use of the substrate as gate 
electrode is feasible, but is not suitable for fast circuit operations. This con- 
cept might be improved by an additional silicon layer that includes the gate 
electrode. 
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Fig. 13.6. Different concepts for SET fabrication: (a) metal balls that are coated 
by an insulator, (b) different deposition angles result in two tunneling elements and 
an island 




Fig. 13.7. SOI technique in silicon: The two thin bridges connect the island to its 
surroundings, the substrate forms the gate electrode, which is not too convenient for 
circuit design 
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This example shows distinctly that it is very important to find an appro- 
priate structure for the SET implementation. The contacts have to have small 
parasitic capacitance, meanwhile the tunneling contacts have to be reliable. 



13.2 SET Circuit Design 

The following examples explain the principles of SET circuit design. How- 
ever, their practical operation is still a long way off. The SET circuit design 
fashion is strongly motivated by typical CMOS design techniques as well as 
QCA design techniques. From the present point of view SET circuits are very 
promising candidates for memory applications, artifical neural networks, and 
low-power applications [64]. 



13.2.1 Wiring and Drivers 

The charge of a single electron can not drive a metallic wire. Consequently, 
the charging of a long wire via a SET would take a long time. A possible 
solution might be a shift register that acts as a wire. Within such a shift 
register the data is represented by single electrons that are shifted from one 
node to the other (Fig. 13.8). For such a configuration the (dis)charging of a 
long wire is not a issue. However, the shift register needs a clock supply for a 
fast data exchange, which consumes electrical energy. A similar concept has 
been presented for the QCA circuit design in Chap. 10. 

Two clocked SETs can be operated as a current driver (Fig. 13.9). They 
act as an electron pump: If the gate voltage is altered with frequency / within 
each clock cycle one electron can pass from the left electrode via the two 
tunneling elements to the right side. Therefore, the current I is proportional 
to the frequency f: 



I = qf. (13.19) 

An atomic clock might control the frequency f so that a very precise control 
of the current is feasible. This is a very interesting issue for measurement 
standards. 

For circuit designers the driving capabilities are more interesting: The 
characteristic impedances of the circuits are relatively high, however, the ca- 
pacitances are small and the pumped current increases with the frequency. 
Assuming resistances of 100 kf2 and capacitances of 10 -16 F the switching 
time is about 10 ps. Using a clock period of 10 ps it is possible to pump 1 to 
10 2 electrons onto the output capacitor within 1 ns. This number of electrons 
can already drive a CMOS interface circuit, in particular if the outputs can 
be driven in a parallel way. 
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13.2.2 Logic and Memory Circuits 

According to Fig. 13.5 the transistor toggles with the gate voltage between 
a conducting and an insulating state [65]. Two serial-connected SETs can 
exploit this effect in such a way that they operate as an inverter (Fig. 13.10). 
Depending on the input signal only one of the two SETs is in the conducting 
state. In addition, Fig. 13.10 illustrates a programmable logic gate. Via the 
control input the gate operates as a logic NOR or NAND gate. 
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In principle, it is possible to build a memory cell out of two inverters. An 
even smarter solution is a ring-shaped memory cell that only shifts electrons 
between the islands (Fig. 13.11). Similar to a dynamic memory cell (DRAM) 
the information can also be stored as charge on a capacitance [66]. 




An interesting version of such a memory cell can be seen in Fig. 13.12. The 
SET is integrated into the MOST. For high gate voltages the SET pushes a 
further electron on its island. This enlarges the threshold voltage of the MOS 
transistor. For lower gate voltages the SETs island gets discharged again. 




Fig. 13.13. SET memory chain stores several electrons 



The SET memory chain in Fig. 13.13 can be used to store several electrons 
on a single memory node. If the voltage Vq is tuned in an appropriate way the 
electrons move to the left side of the chain. On each island a single electron gets 
trapped by means of the Coulomb blockade. If n is the number of tunneling 
elements of the chain structure (n — 1) electrons can be captured by this 
method. The electrons move towards the capacitance Cq if the voltage Vq 
is readjusted. After the discharge process the voltage V ou t corresponds to the 
number of electrons that have been previously in the chain. This memory chain 
might be used for a multivalued data storage, but at present this structure is 
only used to store binary data because of the faint signals. 
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Fig. 13.14. Realization of a SET memory mash 



A realization of such a memory structure is illustrated in Fig. 13.14. Each 
tunneling element consists of silicon particles that are composed in a poly- 
crystalline stack. Every stack can store about 100 electrons, which results in 
a robust signal for the MOS transistor. 



13.2.3 SET Adder as an Example of a Distributed Circuit 



Figure 13.15 depicts the circuit diagram of a SET adder. The distributed 
circuit has been designed in a modular and repeating manner, which is of 
particular importance for nanoelectronic circuits. The input data is fed into 
the adder by means of capacitive coupling. Four individual clocks control the 
processing of the data. Figure 13.16 takes a closer look at a single adder cell. 

The nodes A and B introduce the input data. If an electron is located in 
node A it is shifted to node B during the first clock cycle. This shift is only 
possible if node B is not occupied. In this case the electron moves back to 
node A during the second clock cycle and switches the lower SET into the 
conducting state. As a consequence, the electron in node B disappears. The 
third and fourth clock cycle are reserved for the carry. This example shows 
that complex SET circuits without long wiring distances are feasible. 



13.3 Comparison Between FET and SET Circuit Designs 

The comparison between FET and SET as well as their circuit designs is a very 
interesting issue within the power-delay diagram. Although the power-delay 
diagram only offers a simplified view of both concepts the fields for potential 
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Fig. 13.16. A single adder cell of the SET adder of Fig. 13.15 



applications can be derived. Figure 13.17 shows the normalized switching en- 
ergy Ws/Wso as a function of the scaling factor a, Wso refers to the switching 
energy at a — 1. 

The switching energy of a FET decreases as the device dimensions de- 
crease, because the number of electrons in the channel as well as its capaci- 
tance scale with a factor of a 2 . This scaling procedure can only be continued 
until the last electron is left in the channel. This is the point where the FET 
model turns into the SET model. If the scaling of the physical device dimen- 
sions is continued beyond this point, the charge of the last electron obviously 
can not be scaled. Therefore, the switching energy “scales” from this point on 
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Fig. 13.17. The power-delay diagram reveals the impacts of scaling on the SET 
and FET 



with a -1 , which actually means that the switching energy starts to increase 
for smaller dimensions. 

For a specific case the FET curve in Fig. 13.17 is labeled with the number 
of electrons that are involved in each switching event. The presently used 
CMOS technology handles roughly 10 000 electrons during a single switching 
event. The thermal limit at 300 K is depicted in Fig. 13.17 for comparison 
purposes and is defined as Ws — 50 kT. As denoted in Chap. 8 this is the 
minimum energy that has to be applied to avoid thermal hazards. To operate 
these circuits at 300 K the SET device dimensions have to be very small, while 
the FET device dimensions have to be relatively large. In between these two 
extremes a save operation is only possible via cooling. 

This consideration is of fundamental relevance for nanoelectronics: For 
relatively large device dimensions the energy stored in the capacitor can only 
cope with the above-mentioned constraints if the thermal energy is reduced via 
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cooling. For smaller device dimensions the switching energy becomes higher 
than the thermal energy and no additional cooling is necessary. The increase 
of the stored energy for scaled devices is typical of quantum-effect devices 
such as superconductor switching elements. Their quantized quantity is the 
magnetic flux and will be described in the next chapter. 



13-4 Summary 

Single electron transistors offer new and interesting circuit-design concepts 
since they can be controlled by a single elementary charge. For small device 
dimensions the switching energy is so high that the SETs can be operated at 
moderate temperatures. The impact of disturbing effects such as background 
charge still remain a challenging issue. Besides further technologically oriented 
improvements the investigation of suitable logic families, architectures, and 
the development of a design framework are of fundamental relevance. 
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Superconductivity is a fascinating effect from which we assume that devices 
with ideal characteristics can be built. It turned out that the very low resis- 
tances of a switched superconductor have a negative impact on its transient 
behavior. Superconductivity is not a good candidate for further microminia- 
turizing except for some highly specialized applications due to the increase 
in switching energy with decreasing feature sizes. The reason for this effect 
is based on the fact that a flux quantum is more efficient than the electronic 
charge. Nevertheless, the research on superconducting electronics is valuable 
and important, especially with regard to nanoelectronics. 

14.1 Basics 

14.1.1 Macroscopic Characteristics 

The most popular characteristic of a superconductor is its vanishing resis- 
tance below a certain temperature, the transition temperature Tc- With the 
vanishing resistance there occurs the Meifiner-Ochsenfeld effect: A magnetic 
field is displaced out of the superconductor by surface currents, Fig. 14.1. In 
this case a superconductor and an infinitely good conducting material differ. 
The latter shows no change in the field characteristics within the conductor. 

A very important diagram for the characterization of a superconductor is 
the H—T diagram in Fig. 14.2, which depicts the transition from the normal 
conducting to the superconducting state. Applying a sufficiently large mag- 
netic field II to the superconductor switches it from the superconducting to 
the normal conducting state. This happens even far below the transition tem- 
perature. For the condition T — OK this field strength is called the critical 
field strength He. A current inside the superconductor results in a magnetic 
field, which in turn is able to change the state of the superconductor. 

The transition temperatures and the critical field strengths of some super- 
conductors as well as selected boiling temperatures and coolants are summa- 
rized in Table 14.1. 
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a) 




b) 



Fig. 14.1. Characteristics of an ideal superconductor: Resistance in relation to the 
temperature (a), displacement of the magnetic field (b) 



Table 14.1. Transition temperatures and boiling points of selected superconductors 
and cooling agents 



Superconductor Superconductor High- Tr Boiling points 

1st kind 2nd kind superconductor 



A! 


1.2 K 


Nb 


9.2 K 


YBCL 2 Cu 3 O r 92 K He 


4.2 K 


In 


3.4 K 


NbN 


16 K 


n 2 


77 K 


Sn 


3.7 K 


Nb 3 Ge 


23 K 






Pb 


7.2 K 











We distinguish between superconductors of the 1st kind, which show widely 
ideal characteristics and are applied in electronics, and superconductors of 
the 2nd kind with higher transition temperatures applied in heavy current 
engineering. The latter does not displace the magnetic field completely. High- 
ternperature superconductors, first discovered in 1983, are promising candi- 
dates for a broad range of applications but a comprehensive research activity 
is missing. New materials with higher transition temperatures are discovered 
continually. Materials with transition temperatures of T = 135 K exist. At 




Fig. 14.2. State diagram of a superconductor 
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present superconductors at room temperature are under investigation. High- 
temperature superconductors are based on ceramic material that are difficult 
to manage in electronics. 

Another important aspect for the application of superconductors are the 
boiling points of the cooling agents, for example liquid helium (T = 4.2 K) and 
liquid nitrogen (T = 77 K). High-temperature superconductors can be cooled 
in a less expensive way with liquid nitrogen. Today achieving low temperatures 
is not a problem at all. Refrigeration units of different sizes are available at 
reasonable prices but consume a large amount of space and energy. Most of 
these units contain some mechanical devices that are more susceptible to faults 
and are therefore more unreliable than integrated circuits. 



14.1.2 The Macroscopic Model 

In the 1960s the first microscopic models for superconductivity were estab- 
lished. According to the BCS theory electron pairs interact and form so-called 
Cooper pairs. The energy of a Cooper pair is smaller than those of two elec- 
trons therefore, similar to a semiconductor, a bandgap of 2 AE is formed, see 
Fig. 14.3. The bandgap is 3.5 ks Tcj and increases with the transition temper- 
ature Tc of the superconductor and with decreasing working temperature. 
Additionally, the model shows that next to the superconducting electrons 
nonsuperconducting electrons are present. 





sc 



Fig. 14.3. Band diagram of a superconductor with a bandgap of 2 AE at the Fermi 
level (a). Concurrent existence of electrons NC and Cooper pairs SC (b) 



If we apply a direct current to a superconductor only the Cooper pairs 
cause the current flow, whereas the normal conducting electrons do not move 
with the applied field, see Fig. 14.3b. Several collisions with the lattice slow 
down their movement before they are short-circuited by superconducting elec- 
trons. In the case of superconducting electrons both electrons of a pair must 
scatter at the lattice simultaneously with the same amount of energy loss. 
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This is very unlikely, therefore the resistance is zero. If we increase the cur- 
rent flow the kinetic energy of the Cooper pairs is also increased. If this energy 
exceeds the bonding energy 2 AE the Cooper pair splits resulting in a loss of 
superconductivity. Here the critical current intensity Iq is reached. 

If we apply an alternating current the behavior of superconductors is de- 
scribed with the model of London. An alternating field accelerates both, the 
Cooper pairs and the normal conducting electrons within the superconduc- 
tor, see Fig. 14.4. Collisions with the lattice cause a slow down of the normal 
conducting electrons and increases the energy of the lattice. Taking this into 
account the equivalent electric circuit in Fig. 14.4b has two branches, one with 
only an inductance to represent the superconducting electrons and another 
with an additional ohmic resistor for normal conducting electrons. 



a) 




b) 
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Fig. 14.4. Behavior of superconductors when applied to alternating current: Density 
of both types of electrons in relation to the temperature (a), equivalent electrical 
circuit (b) 



The quality factor of a resonator built of superconducting devices is not 
perfect because both types of electrons are responsible for the effective current 
flow when operated with alternating current. Nevertheless, the quality factors 
of superconducting resonators are better than those of nonsuperconducting 
resonators, which makes them valuable in practice. This characteristic has 
also a beneficial effect for superconducting interconnection lines on integrated 
circuits. 

14.2 Superconducting Switching Devices 

The following deals with different superconducting devices from past to 
present to give a first impression of the development in this area. 

14.2.1 Cryotron 

A simple device with present minor relevance is the cryotron as depicted in 
Fig. 14.5a, 
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Fig. 14.5. Structure and mode of operation of a cryotron (a) and a Josephson device 
(b) 



A control line with a high transition temperature crosses another line. At 
the junction the line can be switched between the superconducting and normal 
conducting state by applying a control current. Usually the lines are made of 
lead and tin and are operated just below 7 the transition temperature of tin. The 
permanently superconducting control line (lead) can toggle the underlying line 
(tin) between the superconducting and normal conducting state. Therefore, 
a cryotron equals a switch that controls the resistance between zero and a 
limited (small) resistance. 

The drawbacks of the cryotron are its low 7 performance due to a phase 
transition, the difficulty to microminiaturize the device, and the quite small 
resistance in the normal conducting state. 



14.2.2 The Josephson Tunneling Device 

The drawbacks of the cryotron are mostly avoided by the Josephson device. 
In relation to Fig. 14.5b an additional tunneling device at the junction of both 
lines is applied. A 3 nm thick tin-oxide layer as a tunnel barrier is placed on 
the wire made of tin. Similar to a semiconductor tunnel diode the voltage- 
current characteristic depends on the bandgap of the structure. 

The Cooper pairs tunnel through the barrier, which is equal to a super- 
conducting junction. Therefore no voltage drops at the tunneling device, see 
Fig. 14.6. When reaching the critical current the bias point moves to the right 
and the voltage drop across the tunneling device is proportional to the band 
gap Vs = 2 AE/q. The voltage remains constant even if the current decreases. 
The device returns to the superconducting state if the current is reduced to 
zero. 

If tunneling occurs within the right side of the characteristic the Cooper 
pairs deliver energy by means of photons, depending on the width of the 
bandgap. In this area, high-frequency oscillations within the Josephson device 
occur. Their frequency is proportional to the voltage drop across the tunneling 
device, (14.1). 
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Fig. 14.6. ( a) V-I characteristic of a Josephson device, (b) state-chart of the device 
in presence of a magnetic field (b) 



/ = y V s - ( 141 ) 

Such devices can be used as oscillating circuits and are interesting in the 
field of neural networks with chaotic behavior. The voltage can be precisely 
controlled hj the frequency, hence we can use this device as a voltage standard. 



a) 





magnetic field 





Fig. 14.7. Current distribution inside a Josephson device with regard to a magnetic 
field controlled by a current. The magnetic field increases from (a) to (d) 



If a magnetic field is applied to the tunneling device a transition to a 
high-resistance state occurs and the current disappears. First, the current 
density at the tunnel junction is uniformly distributed. When increasing the 
magnetic field the tunnel junction reduces the current. This effect is due to 
the interaction of tunneling Cooper pairs with the flux quanta. If the current 
distribution comprises a full wavelength, the net current density is zero and the 
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current through the tunnel junction disappears. In this case the magnetic flux 
through the tunnel junction is equivalent to one flux quantum. If the magnetic 
field increases, additional flux quanta can penetrate into the tunnel junction, 
which results in further switching operations in accordance with the number 
of flux quantum, see Fig. 14.7. The maximum peaks become smaller because 
more wavelengths determine the net current. The minima of the current occur 
at multiples of the magnetic flux quanta. These flux quanta represent the data 
signals within the SFQ- logic. Due to the absence of a phase transition of the 
material the switching time is very short (some picoseconds) . Additionally, the 
resistance is relatively high. The last-mentioned are advantageous compared 
to the cryotron, which results in the displacement of cryotrons in electronics. 



14.3 Elementary Circuits 

In the following several basic circuits with superconducting electronics will be 
presented. 



14.3.1 Memory Cell 

Due to the permanent current flow 7 in a superconducting ring the implemen- 
tation of memory cells seems to be very attractive. A memory cell can be 
composed of a superconducting ring where the states are represented by a 
current flow or not. A circuit technique can use a Josephson device or a cry- 
otron for composing a switch as shown in Fig. 14.8. 

Mode of operation: If a current Iq is applied to the ring the current is 
shared between the two branches in the ratio of the two inductances L\ and 
I/ 2 , (14.2). 
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(14.2) 



The necessary condition for this operation is that the magnetic flux inside 
the ring is zero. 




Fig. 14.8. Memory cell: If the information is stored the current flow occurs only in 
the right branch. Not until the externally applied current is switched off does a ring 
current begin to flow, I\ = —1 2 
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If the cryotron is switched to the normal conducting state the total current 
flows through the other branch: I 2 = Iq. When switching off the cryotron the 
condition is maintained. If the current Iq is turned off the current distribution 
in the ring changes in such a way that the magnetic flux is kept constant 
inside the ring. 



h = ~h = T ~~ TT ~I 0 - (14.3) 

Lj\ + Lj 2 

The information can be sensed destructively by switching on the cryotron. 
The current and the magnetic flux decay and a voltage, which can be measured 
externally, is induced. The information is nondestructive^ readable if the ring 
current is sensed by an additional cryotron in a special read wire. 



14.3.2 Associative or Content-Addressable Memory 

Storing of information in an associative memory cell is equal to the above- 
mentioned case. An additional wire composed as a ring wire acts as a switch 
for controlling the information processing. If a current flows in this branch 
an ohmic resistance occurs inside the switch, with no current flow the switch 
remains superconducting. To store an association the corresponding data sig- 
nals are externally applied to both rings. If the applied signals match with 
the stored signals the current inside the branch is zero and the switch remains 
superconducting. If the signals match within a whole word the entire wire is in 
the superconducting state. Superconductivity is advantageous when sensing 
the information. Among a great number of data lines the current is able to 
detect the superconducting line and therefore the associating word. The zero 
resistance is an outstanding characteristic that can yield a large associative 
memory. 



h 





h 



Fig. 14.9. Memory cell: Only a current flow inside the left branch activates the 
switch in the crossing wire (thin line) 



The switching performance shows a fundamental difference from those of 
MOS circuits. One of the main interests in the design of transistor circuits 
is a fast recharge of the internal capacitances, i.e. to minimize the resistance 
R in. the time constant r = RC. In terms of superconducting electronics 
the current has to decay as fast as possible, as shown for the memory cell 
mentioned above. In this case the time constant is r = L/it, i.e. the larger 




14.4 Flux Quantum Device 233 



the resistance the smaller the time constant. This condition can not easily be 
fulfilled with superconducting electronics. 

The vanishing resistance (R — > 0) has a favorable effect. A branch with 
R = 0 takes over all the current after a certain amount of time, even if other 
branches have a resistance near zero. This behavior is a good prerequisite for 
a current-controlled circuit technique. To suppress unwanted reflections addi- 
tional resistances have to be applied for the termination of superconducting 
wires. Due to these ohmic resistances the power dissipation of a superconduct- 
ing wire is not zero. 



14.3.3 SQUID - Superconducting Quantum Interferometer Device 

A SQUID (superconducting quantum interferometer device) is a well-known 
basic circuit consisting of a superconducting ring with two Josephson tunnel- 
ing devices as shown in Fig. 14.10. Due to the magnetic flux a coupling and 
therefore an interference exists between both Josephson tunneling devices. 
This effect shows quantum characteristics in a macroscopic structure. 





Fig. 14.10. SQUID: State diagram of two parallel-connected Josephson elements 



Taking the characteristic of Fig. 14.7 one can derive the characteristic of a 
SQUID as denoted in Fig. 14.10. If the applied field in one element is increased 
by the control current it is reduced by the same amount in another element. 
The superposition of both characteristics results in a regular characteristic 
in the state-chart. This device can be used as a memory cell. Due to the 
sensitivity of the device another application is the precise measurement of 
magnetic fields. 



14.4 Flux Quantum Device 

Superconducting memory cells and logic circuits can be realized in such a way 
that only one magnetic flux quantum is moved per operation. Taking this is 
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into account we get quantum electronics with superconductors. The size of a 
flux quantum is only 2 mV ps, hence the switching time of the gates must be in 
the domain of picoseconds if sufficient voltage signals are to be achieved. Such 
circuits have been made and tested with frequencies up to 100 GHz. The high 
performance is a crucial advantage for this kind of superconductive circuits. 

14.4.1 LC-Gate 

An interesting approach for superconducting coils is the so-called LC logic. 
The basic structure is an oscillatory circuit as shown in Fig. 14.11. A switch 
can short-circuit the coil or split off the capacitor. 

Such an arrangement can be realized with superconducting devices. First 
we consider the case that the information is stored as charge inside the ca- 
pacitor. If the switch is closed the electrical energy is converted into magnetic 
energy and back again into electrical energy. With it the voltage changes its 
sign. If the switch is opened when the current is zero the circuit changes to 
the other binary state. It seems that no energy is needed during the change of 
its state. A very similar process occurs if we look at the magnetic flux inside 
the coil, see Fig. 14.11b. 

Is there really no energy loss in this case? Energy is needed for measuring 
the voltage and the current inside the ring and secondly for the operation of the 
switch during the passing through zero as described in Chap. 3. The amount of 
energy for this is at least kT In 2. Therefore the LC logic is lossy. The relative 
loss of energy increases with decreasing size of the LC logic. Our considerations 
are true until we reach the quantum-mechanical limits where the flux quanta 
and the elementary charge can be seen as packages. Interestingly, the LC logic 
is nowadays under discussion for low-power CMOS. 

14.4.2 Magnetic Flux Quantum - Quantum Cellular Automata 

A basic circuit that transfers quantum packages are quantum cellular au- 
tomata (QCA) with magnetic flux quantum SFQ. The configuration of these 
devices equals the QCA devices with electrons mentioned in Chap. 10. In this 
case the electrons are replaced as shown in Fig. 14.12 b}^ flux quanta in two 
fields of the four potential wells. 

The flux quanta can be moved by the application of external magnetic 
fields. Due to energetic conditions adjacent flux quanta move in opposite di- 
rections. Circuits of this kind exhibit a relatively slow performance. Josephson 
tunneling devices are more attractive because only one flux quantum is needed 
for changing the state of the circuit. 

14.4.3 Quantum Computer with Single-Flux Devices 

Another interesting approach realizes Qubits with Josephson devices in su- 
perconducting rings. Such devices have been successfully built by Mooji from 
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Fig. 14.11. LC logic, (a) storage of energy in an electric field, (b) storage of energy 
in a magnetic field 
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Fig. 14.12. QCA with magnetic flux quanta. An applied field can change the posi- 
tion of the magnetic flux quanta 

the University of Delft. Here the two states of a Qubit can be expressed with 
two opposite flux quanta or with two ring currents of opposite directions. The 
advantages of this technology are the large feature sizes of about 1 fim and 
the possibility to read-out the devices electrically. This simplifies the problem 
of the I/O interfacing. 

If a few such devices are joined together we get a Qubit chain that can 
perform the operations mentioned in Chap. 5. They can be operated as clas- 
sical memory cells (Cbits) by slightly modifying their arrangement. In this 
way an interface to a quantum computer can easily be realized. Figure 14.13 
shows the structure of such a computer where the superconducting functional 
units are controlled by a conventional computer through the Cbit devices. In 
this case quantum effects occur at the macroscopic level, which is essential for 
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Fig. 14.13. Architecture and Qubit element of a superconducting computer 



the quantum computer. Therefore the superconducting technology might be 
a good candidate for such computers. 



14.4.4 Single Flux Quantum Device - SFQD 

A Josephson device stores single flux quanta that can be used for infor- 
mation processing. A possible implementation of such a circuit is depicted 
in Fig. 14.14. A transmission line with a characteristic wave impedance of 
Z « 10 D is connected in parallel to a Josephson tunneling device. A bias 
current forces the circuit to the initial state ‘O’. An additional impulse raises 
the current through the Josephson device above its critical current. The tun- 
neling device switches and the operating point moves to Ah If the impulse 
vanishes the device returns near to the point c 0 5 without reaching its initial 
state. Only if the bias current is turned off, will the device return to its initial 
state. Due to this complex switching process this circuit is comparatively slow. 
This technique has been investigated in the 1970s. 

If the impulse is very short, only a single flux quantum is necessary for 
the operation. The voltage integral over the time about the resulting voltage 
impulse is equal to one flux quantum, i.e. the impulse height is 1 mV and the 
impulse width 2 ps. In this borderline case only one flux quantum is used, 
which is expressed in the name SFQD - single flux quantum device. Taking 
this idea as a starting point Likarev proposed the RSFQ logic. 
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Fig. 14.14. Transformation of a few or a single flux quantum into a voltage impulse 



14.4.5 Rapid Single Flux Quantum Device » RSFQD 

A dynamic SFQD is the so-called rapid SQFD as shown in Fig. 14.15. The main 
difference to the SFQD is the use of an inductance instead of the transmission 
line. Later a second Josephson device is connected in parallel to the first one. 
If the first tunneling device changes its state due to an applied impulse the 
switching process differs from that above-mentioned. The second tunneling 
device takes over the current delayed in time (effect of the inductance) and 
prevents the complete switching of the first tunneling device. The circuit is 
strongly damped to prevent overshooting. 




Fig. 14.15. Transformation of a single flux quantum into a voltage impulse 



Assuming correct sizing of the devices, only one magnetic flux quantum 
is moved during this operation; it can be sensed at the output node. After 
completing the operation the circuit returns to its initial state. Due to the 
avoidance of the normal conduting mode the circuit is very fast. 

Figure 14.16 shows a circuit that transfers a direct current signal in an 
succession of impulses, where each impulse corresponds to a single flux quan- 
tum. Applying a direct current the major part of this current flows through 
the Josephson device J 3 . If the Josephson. device switches, the current distri- 
bution inside the circuit changes. The device J 3 generates an impulse with 
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the quantity of one flux quantum. Finally, it returns to the superconduct- 
ing state and the initial condition is restored. Instead of the inductance and 
the tunneling devices a transmission line with distributed tunneling devices is 
used. 

Both Josephson tunneling devices are coupled through, a superconducting 
inductance, which minimizes the amount of energy during the switching pro- 
cess. Energy is transmitted in the form of one flux quantum. The flux quantum 
itself is linked to a pair of electrons, therefore no energy is dissipated. 



14.5 Application of Superconducting Devices 

14.5.1 Integrated Electronics 

As with transistors logic gates and other digital functional elements can be 
composed with cryotrons or Josephson devices. The Josephson devices require 
a new circuit technique with many special features similar to those of circuits 
fabricated in GaAs. Since the invention of the Josephson device in the 1960s 
the technology has been improved so that large integrated circuits can be 
realized. As known from the silicon planar technology this technology is also 
bases on the deposition of thin layers on a silicon substrate. A commond choice 
as conducting material is niobium. Isolation layers can be manufactured by 
evaporating silicon oxide (SiO) or by depositing silicon dioxide (SiC^). The 
isolation layer inside the Josephson device is made of aluminum oxide with a 
thickness of only a few nanometers. Corresponding to the planar technology 
all terminals are located on the surface of the chip. 

Many of the large-scale integrated circuits are made for the application 
of radar engineering and EMC. Their main characteristics are high-frequency 
operation and low-power consumption together with a high precision. The 
clock frequency rate is up to 40 GHz. Another project was the implementation 
of a 4 bit microprocessor with 24 000 Josephson devices. With an average 
feature size of 1.5 fim an area of 25 mm 2 is occupied. Without any optimization 
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with regard to the packing density the circuit can be operated at a frequency 
of up to 1 GHz. The necessary memory chips are designed with a capacity of 
4 kbit. A memory chip of this kind occupies an area of 36 mm 2 and exhibits 
an access time of 0.58 ns. 

Electrical circuits can also be realized with high-temperature supercon- 
ductors. For example a shift register made of 32-512 YBaCuQ operates at a 
temperature of 77 K at up to 100 GHz. Due to the high working temperatures 
and the small switching energies special steps have been taken to reduce the 
failure rate due to thermal fluctuations. At this temperature silicon circuits 
can also be operated, which allows the integration of CMOS interface circuits 
at the same time. According to the principle of RSFQ devices different logic 
gates and more complex gates can be realized. Systems composed of these cir- 
cuits allow a high-speed operation with clock frequencies of up to 900 GHz. 

A MOS device operating at low temperatures is interesting for various rea- 
sons: The increased mobility of charge carriers due to the reduced collision rate 
with lattice atoms. Due to decreased diffusion and recombination processes 
the inverse leakage currents are reduced. The decreased voltage equivalent of 
thermal energy reduces the Debye length. A temperature of 77 K can easily be 
achieved with liquid nitrogen. Therefore the application of MOS electronics is 
beneficial. Despite all these improvements the effort for cooling the circuits is 
only useful for large data-processing systems. 



14.5.2 FET Electronics - A Comparison 



The most interesting approaches concerning the nanoelectronics are the single- 
electron transistor and the single flux quantum device. Both devices are work- 
ing with single quanta hence the borderline of classical electronics is reached. 

Thus, the following comparison is of fundamental importance for nano- 
electronics. Inside the superconducting electronics the switching energy per 
switching operation is set by the current and by the inductances. Besides the 
switches, inductances are the most important devices. The energy is deter- 
mined by 



Esc = LI 2 = 



( til $of 
L 



( 14 . 4 ) 



Inside the field effect electronics the main parameters are determined by 
the voltage and the capacitors. Charging of a capacitance needs the energy of 



Esc = CV 2 = ( 14 . 5 ) 

O 

In (14.4) and (14.5) the energy depends on the number ul of flux quanta 
and on. the number nc of electrons. If the error probability caused by thermal 
noise is equal in both cases the switching energy must exceed the thermal 
energy kT by orders of magnitude. One has to remember that the operating 
temperature of both circuits differ considerably. The condition for equal error 
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probabilities can be derived by using the relationship of nc to til according 
to 



n c _ / C Th 

til Q V L Tl 



(14.6) 



The term yL/C can be replaced by the wave impedance Z. Introducing 
the free-space impedance Zq we get 



qZo 

Tq 



4a. 



(14.7) 



Taking all into account we obtain 



nc ___ 1 Zq ITh 
ul 4 a Z V Tl 



( 14 . 8 ) 



If we take typical wave impedances between 3 ----- 30 Q a resistance ratio 
between 10 — 100 is obtained. The ratio of the temperatures is about 100. With 
it the ratio of the elementary quantities is around 10 5 . On the assumption that 
both devices are of equal size and with equal error probabilities a switching 
operation needs 10 5 times more electrons than flux quanta. 

If a device is scaled down by a and assuming til > 1 the switching energy is 
reduced by a -3 . If til — 1 the switching energy is proportional to a. The last- 
mentioned is valid for the actual dimensions in superconducting electronics 
so the switching energy is increased when reducing the size of a device. A 
switching operation with field effect electronics needs more than 100 charge 
carriers, therefore the switching energy decreases as a 3 with the exception 
of single-electron transistors. In both cases the switching times are reduced 
by a -1 . Superconducting electronics consume more energy, which makes it 
superior to FET logic, which describes the highest possible clock frequencies. 




Fig. 14.17. Switching energy of FET and SC electronic with regard to the feature 
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Superconductivity is not the best candidate for the development of nano- 
electronics, because the development of nanoelectronic circuits is mainly set 
by parameters like the energy dissipation. On the other hand, the standby 
power dissipation is zero for superconducting electronics, which is of great 
importance even in large systems. 

Interestingly the use of superconducting electronics does not result in a 
saving of energy. The power dissipation of a circuit is Ey ------ rriykTL. The 

variable my is a proportionality factor that includes the complexity of a circuit 
and the quantity of the switching energy. To cool a superconducting circuit 
from room temperature Tjj to a lower temperature Tl a refrigerating machine 
must be applied. The machine needs the energy according to the theorem of 
Carnot 

Tn - Tr 

E m — E v H L . (14.9) 

J- L 

The total power dissipation is then set by 

•Bail — Em + Ey = my kT H . (14.10) 

The total power dissipation is the exact energy the circuit would consume 
if operated at the temperature Th and assuming equal noise immunities con- 
cerning thermal fluctuations. With regard to the power dissipation supercon- 
ducting electronics show a beneficial effect where no extra cooling is needed, 
e.g. space travel. 

The transition from superconducting electronics to roomtemperature elec- 
tronics is not easy. A good conductor for electrical current is also a good 
conductor for heat. To get an acceptably high thermal resistance the conduc- 
tor has to become relatively long or highly resistive. Both conditions deteri- 
orate the signal transfer time and the signal attenuation. A remedy for these 
problems are optical fibres on the condition of appropriate interface circuits. 

Another topic is the reliability of such circuits. Electronics operated at 
low temperatures is resistant to aging due to reduced diffusion processes. On 
the other hand, repeated cooling and heating leads to a reduced durability. 
The difference between the expansion coefficient of the materials causes high 
mechanical stresses. 



14.5.3 The Electrical Standards 

The investigation of quantum electronic devices is interesting because the 
‘transistor’ parameters are set by fundamental constants. Therefore no pa- 
rameter scattering exists. Nevertheless, these effects are more important for 
standard measures than for electronics. 

At the moment the following effects can be used for standard measures: 

1. A sinusoidal voltage applied to a single-electron transistor causes a cur- 
rent I that is the product of the elementry charge and the frequency. The 
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Fig. 14.18. The triangle of the quantum measurements for electrical standards: 
voltage, conductance and current 



oscillations of atoms can be used for the determination of physical quanti- 
ties like time and frequency. Since the frequency can be controlled exactly, 
such devices can be used as a current standard. 

2. A Josephson device can be used as a voltage standard. In this case a 
voltage V adjusts a frequency, which in turn can be measured exactly. 

3. The quantum Hall effect (von Klitzing) combines the quantities voltage 
and current on the basis of a fundamental constant resulting in a standard 
for the electrical resistance R — 1/G. 

Figure 14.18 summarizes the three basic effects with regard to the standard 
measures. Most of these effects are interesting for metrology but will gain more 
interest for nanoelectronics. Their main advantage is the lack of parameter 
scattering. 



14.6 Summary 

Although superconductors exhibit some attractive and interesting character- 
istics their application in the field of information processing is limited and 
restricted to some special cases. This is due to the necessity of the very low- 
temperature ranges that are inherent to superconductivity. Another serious 
limitation comes from the high action of a flux quantum. Further microminia- 
turizing will reach the limits of quantum mechanics with the characteristics 
of high power dissipation and short switching times. Nowadays the direction 
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in the development of superconducting circuits goes towards very high speed 
information processing in the area of low temperatures, e.g. space electronics. 




15 



The Limits of Integrated Electronics 



15.1 A Survey about the Limits 

The following describes the most important limits that will restrict the devel- 
opment of micro- and nanoelectronic circuits [45]. The physical limits are of 
a fundamental nature and can not be overcome. This is true, with the reser- 
vation that the limits as seen today are not necessarily final ones because the 
research in physics continues. We can expect considerable progress in the field 
of quantum computing , which will move some limits far beyond current ones. 
Some of these physical limits that are presented for classical microelectronics 
are summarized below. 



physical 

limits 



t2 



tl 



limits set by complexity 




acceptance 



limits set by 
technology 

Fig. 15 . 1 . Some limits of micro- and nanoelectronics ME/NE. Further develop- 
ment enlarges the circle ME/NE, so the borderline can reach, different limits. The 
technological limit is indicated for the times tl and t2 
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As seen in Fig. 15.1 before the circle reaches the physical limits technologi- 
cal limits are reached. These limits address topics like device dimensions, chip 
area, and power-dissipation density [2]. The technological limits are time de- 
pendent and can be moved to better values with the development of advanced 
technologies. On the other hand, the development of new technologies can be 
too expensive, so economics would restrict further developments. 

Thermodynamic effects have a great influence on the characteristics of 
microelectronic circuits. Due to their thermal energy the particles in a solid 
are in constant motion. Similar to the spread of an ink drop when it is put into 
water the concentration gradients inside a solid vanish after a certain period 
of time. During the compensation diffusion currents appear. Additionally the 
concentration distribution is subject to fluctuations. All these processes affect 
the characteristics of a device, i.e. they determine parameters like current flow, 
noise, heat conduction, doping profiles, parameter spread, and reliability. 

In the 1980s one limit was set by the complexity of microelectronic devices. 
Against all apprehensions this limit could be overcome. New approaches in the 
field of mathematics and computer sciences have pushed and will probably 
push this limit further away. 

Sometimes the more important limits are set by social constraints. If prod- 
ucts are not accepted by the market further development will stop. Products 
that are assumed to damage our health, e.g. products that emit radio waves, 
can limit their acceptance on the market. Their acceptance also depends on 
the costs of the development. If these costs are too high or it is only assumed 
they could not be paid, the development will be stopped. In all these cases the 
acceptance of a new technology as a limiting factor is of great importance. 

The reflection upon the limits are essential for a company. A product 
should be optimized permanently if a good market position is required. System 
performance and reasonable prices are prerequisites for a successful product. 
The problem during the development of a product is the permanent change of 
technical and economic conditions. The solution of this optimization problem 
can be simplified by knowing the limits of the technolog}^. They indicate the 
available scope of the development of that technology. 



15.2 The Replacement of Technologies 

Business management is of crucial importance in the tremendous improvement 
of microelectronics. The costs of a transistor or a device have been reduced 
by orders of magnitude over the past years. While the wafer costs stay mainly 
constant integration density has changed enormously, see Fig. 15.2. The yield 
increases due to a reduction of the failure rate and parameter spread. Under 
these conditions the introduction of a new technology, results in lower costs 
which in turn drives the further development. From todays perspective the 
development of silicon technology continuous up to 2010 without reaching 
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any degree of saturation. At this time new technologies with a certain degree 
of ripeness are launched to market. 

The reduction of costs is possibly due to the present human and material 
resources. Due to the number of competitors, who are subject to the same 
economic and social conditions, there is an obligation to take part in the 
development of new technologies if one aims to be a leader in one field of 
technology. This success is necessary due to the high costs of a chip foundry. 
At the moment only a few ‘nations’ are able to invest in such technologies, 
e.g. Japan, US, and Europe. Because of cost savings the main activity centers 
are shifting to the Far East. 

An alternative is the change of huge chip foundries to a large number of 
decentralized small ones. The small chip foundries can satisfy the needs of 
the customers. How far this approach is successful from the technical and 
economical point of view is still not certain. More probable are small, flexible, 
and computer-controlled fabs with fast and individual custom design of ICs. 

In the future, nanotechnology may create new technological approaches 
that are cheaper than today’s silicon technology. Self-assembling of devices 
without an expensive lithography may play an important role for the emerging 
technologies. In this case the replacement of technologies with improved ones 
can be done sooner. 




Fig. 15.2. Costs per device. The limit of silicon technology is expected for the 
year 2010. After this, new technologies will emerge (a). If these technologies are 
cheaper the change of a technology occurs sooner (b). The choice of technology is 
still uncertain (?) 
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15.3 Energy Supply and Heat Dissipation 

For microelectronics, and in future for nanoelectronics, heat generation in 
integrated circuits is of great importance. Heat conduction can limit the per- 
formance of a system. The following idea can be used as a starting point, see 
Fig. 15.3: A. functional block contains 10 12 nanostructured switching elements 
in a volume of 1cm 3 . If each element requires a switching energy of 1 f J and 
is operated at a clock frequency of 1 GHz a switching current of 1 000 000 A. 
flows through the block with the dimension of 1 cm. If the devices are operated 
at a voltage of 1 V the power dissipation is 1000 kW. Assuming a switching 
energy of 1 aJ, which seems to be attainable for nanoelectronics, the switching 
current is still 1000 A. If only 0.1% of the devices are active at the same time 
the switching current is 1 A. This idea shows the important role of a good 
system design concerning aspects like heat conduction and low standby power 
dissipation. Otherwise the obtained values are difficult to manage in technical 
systems. 



/ 




nanoelectronic system, volume 1cm 3 

Fig. 15.3. The problem of the power supply of a system with 10 12 devices 



Heat conduction is a diffusion process. The power that can be dissipated 
as heat can be expressed with the diffusion equation: 



P,„ = -*g. (15.1) 

Pth is the power of the heat flow, T the temperature, and k denotes the 
thermal conductivity, which corresponds to the diffusion constant. Assuming a 
device with the temperature T\, the thermal resistance of R t h, and an ambient 
temperature T 2 we can write (15.2) according to Ohm’s law: 



P th = 



(Ti - T 2 ) 

Rth 



(15.2) 



The ambient temperature and the maximum working temperature are 
given, so the thermal resistance Rth determines the heat flow, i.e. the heat 
dissipation. 
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The power dissipation has its originate in the transistor itself. From there 
the power dissipation or heat has to pass through the chip and the module 
before the working environment is reached. Next to the improvement of micro- 
electronics the heat conduction could also be improved, as denoted in Fig. 15.4. 
At the beginning the improvement was due to improved printed circuit boards 
and packaging. Heat conduction Pm could be increased to lW/cm 2 . The ap- 
plication of a ventilating fan increased this value to 10 W/cm 2 . The next step 
in the development was fin cooling where the chips on a module are cooled 
with water from the backside. The backside exhibits a number of grooves that 
are fabricated by micromechanical structuring. This results in a Pm of about 
100 W/cm 2 , which is a typical value for high-performance microcomputers. 

During the design of a microelectronic system the context between heat 
conduction Pm of a module and the technologj^-dependent power-delay prod- 
uct of a gate is of great importance [45]. This context is explained with the 
help of the following simple derivation. Let us assume a module containing n 
integrated devices as shown in Fig. 15.5. The module area Am containing the 
chip must be large enough to emit all the heat dissipated by the active device 
n a on the chip and can be expressed by 

Am Pm > Pvan a - (15.3) 



This assumption limits the maximum temperature of the chip. The signal 
delay t m between two chips should be smaller then the delay time td of a gate. 
This leads to 



td > m M 



1 



\J A m vm 



(15.4) 



The term tum is empirical and vm denotes the propagation speed of the 
signals inside the module. If Am is eliminated from (15.3) and (15.4) we get 
the following: 
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This equation determines the limit inside the power-delay diagram of 
Fig. 15.6 due to heating. This diagram contains the characteristics of different 
integration levels and for a heat conduction of 100W/cm 2 . The higher the 
delay time the higher can be the switching energy. 

Another limit follows from the heat dissipation of a driver circuit to the 
chip, see Fig. 15.5. The power dissipation Pvl of a power gate may not exceed 
the maximum allowable heat dissipation of a chip. The heat inside the chip 
should be uniformly distributed, 

Ac Pc A Pvl • (15.6) 

Once again m c is determined empirically, C is the distributed capacitance 
and -\/Ac the assumed wire length. If the chip area is eliminated we get 

PviAdl — me ^ V j . (15.7) 



Pvl tjjL < 



1 m 2 c C 2 Vj 

___ __c 



(15.8) 



If we equate the switching time toL of the driver with the gate delay Id 
the limiting condition of (15.8) can be drawn on a P\ — to chart. 




Fig. 15.5. Heat dissipation on a module and on a chip 



A fundamental restriction of aggressive scaling could be the heat conduc- 
tion especially in those cases where the power dissipation per unit area in- 
creases. Today this problem is not of great interest in information-processing 
circuits. Nevertheless it is an important topic for power stages. 
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If these reflections are passed onto integrated circuits and taking the 
switching time into account we get the chart of Fig. 15.6. This chart con- 
tains the limits for field effect transistors. The removal of heat is actually 
restricted to values around 100W/cm 2 . The more devices integrated on a 
chip the lower must be the switching energy. Lowering the switching energy 
causes smaller dimensions, therefore the performance of the devices increases. 
On the other hand, the absolute limit of lowering the switching energy is set 
by thermodynamics of about 50 kT. The uncertainty relation, leads to the 
quantum-mechanical limit, which is important for single-electron transistors. 




1(T 15 KT 11 1(T 13 1(T 9 1(T 7 

t/s 



300 K 



Fig. 15.6. Switching energy versus delay for SET and CMOS devices 



Also important are the consequences of further scaling down the energy 
supply of the circuits. Even if scaling reduces the supply voltage to V/a , the 
currents on the supply lines increase due to an increase in the number of 
switching devices. From this the well-known problems for power supplying 
integrated circuits with low voltages and high currents occur. Three typical 
effects arise: Due to high currents a voltage loss on the supply lines occurs. The 
losses have a great influence when operating the circuits at low voltages. A 
remedy for this problem are distributed power-supply lines. The high current 
densities can cause a migration of material which in turn leads to circuit 
failures. Due to unavoidable line inductances each current variation causes a 
voltage impulse on the supply lines, which in turn can reduce the reliability of 
the system. Remedies for this problem are supply lines with a lower inductance 
per unit length, blocking capacitors, and circuit techniques that avoid high 
current pulses. 
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15.4 Parameter Spread as Limiting Effect 

Due to the diffusion of particles, layers with different electrical characteristics 
can be built. On the other hand, diffusion is not a equally distributed process, 
which can cause problems inside an electrical circuit. A very important appli- 
cation. of the diffusion process is the production, of integrated circuits. As an 
example the production of a pn-j unction in a planar technique is explained. 
The starting point is a p-type substrate that is covered with a Si02 layer. 
The part of the Si02 layer that is opened by an oxide window is covered with 
n-type phosphorus atoms, see Fig. 15.7a. If this structure is exposed to temper- 
atures above 1000° C the phosphorus atoms diffuse into the p-type substrate, 
which results in a doped layer with a different polarity. In addition, Fig. 15.7 
shows that some of the particles diffuse under the adjacent oxide (underdiffu- 
sion). When the diffusion process is maintained the layer continues growing. 
The doping profile depends on the time and is shown in Fig. 15.7b and can 
mathematically be formulated by (15.9). 




Fig. 15.7. Process of diffusion for the realization of semiconductor structures, (a) 
Diffusion process in a semiconductor and (b) the diffusion profiles as a function of 
time 

In (15.9) Q denotes the initial density of phosphorus atoms and D the 
diffusion constant. If the temperature is lowered the diffusion process does not 
stop but reduces its speed. This also indicates that the electrical parameters 
that are adjusted during the manufacturing process can vary with time and 
can sometimes cause device failures. Therefore the diffusion process is very 
important when the reliability of a device or system is of concern. 

The equal-probability distribution of (15.9) is only a mean value. The other 
values of each pn-j unction vary around this value. This results in a spread 
of the electrical and technological characteristics of the devices. This is a 
fundamental effect for microelectronics and is mostly detrimental to integrated 
circuits. 
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In the following we estimate the parameter spread of the diffusion process 
with the help of a simple example. The starting point is a volume V that 
is divided into N cells each containing n particles. We now ask about the 
probability P that n l particles stay in one half of the volume. From experience 
we know that on the average n ! equals nj 2. Due to thermal motion sometimes 
more and sometime less than nj 2 particles are inside half of the volume. To 
derive the spread mathematically we calculate the number m of possibilities 
to put n ! particles in one half of the volume. Eqation (15.10) can be derived 
with methods of combinatorial analysis: 



m (n / ) — 




In the case of n — N/2 (15.10) can be simplified to: 



(15.10) 



m(n f ) = (15.11) 

n ! 

This equation is evaluated for a very small number of particles N in 
Fig. 15.8. The higher the number of cells and particles the higher the num- 
ber of possibilities. The maximum number of possibilities occurs if half of the 
particles are inside one half of the volume. This state occurs with the highest 
probability, which agrees with our experience of technical systems. 

If the characteristic curves are standardized on the maximum value of 
n/2 we get the chart of Fig. 15.8b. It follows from this that the normalized 
standard deviation decreases with an increasing number of particles, 



a /0.5iVf (1-f) _ ! 

/* 0.5 N§ W 



(15.12) 



This result explains why the spread of the values can be neglected in the 
field of classical thermodynamics. According to the Avogadro constant L one 
mole consists of 6.022 x 10 23 molecules. At the temperature of 0°C and pressure 
of 10 5 Nm~ 2 the volume is 22 414 cm 3 . In this case the scattering region is 
about 10“~ 10 . 

The relations in microelectronics are quite different. If (15.12) is applied 
to the density of a doped region in a device and if we calculate the spread due 
to the diffusion process we get the numerical examples of Table 15.1. 



Table 15 . 1 . Numerical examples 



Relevant volume 


1 cm 3 


1 pm 3 


0.1 /im 3 


Mean number of doping atoms 


icF 


To 3 


10 


Normalized standard deviation 


Tin® 


10"" = 1% 


10”°' 5 = 30% 
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m = 1 



4 



6 



4 



1 




Fig. 15.8. Number m of possibilities to place four particles according to (15.11) (a) 
and for n particles for m! particles per volume (b) 



This example demonstrates that with decreasing volume and therefore 
with a decreasing number of particles the normalized scattering increases. 
This spread influences various parameters of a device. The threshold voltage 
of MQS transistors is mainly set by the doping concentration of the space- 
charge region under the gate. However, the number of doping atoms decreases 
in the space-charge region only with a power of 2 and not with 3 according to 
the scaling factor. Nevertheless, this effect seems to become more and more 
important when scaling down the devices, maybe this is an insuperable limit 
of classical microelectronics. 

Equation (15.13) denotes the depth of the space-charge region depending 
on the surface potential: 




15.4 Parameter Spread as Limiting Effect 255 




The threshold voltage of a MOS transistor is 

s/q2eip N A 

V T = ~ • 



(15.13) 



(15.14) 



The variation of the threshold voltage depends on the change of the doping 
concentration: 



d Vt d N u 4 

= ~Nl 



(15.15) 



According to (15.12) the variation region can be written as follows 



AN a ___ 1 

Na ~ VnJx* 



(15.16) 



Equation (15.16) is inversely proportional to the root of the number of 
doping atoms. With, this relation we can reformulate 



Z\Vy 1 

Vt yj Wax s 



(15.17) 



The variation AVt of (15.17) increases with a decreasing channel region 
and with lower doping concentrations. 

According to Mead and Conway [1] the failure probability of an inverter 
operated at the supply voltage of Vb can be stated to be 

Pl = e -2V B /AV T ' (15.18) 



The probability decreases with higher supply voltages and with a smaller 
scattering. The larger the transistor dimensions the smaller the spread of 
parameters. If the voltage can not be decreased the power dissipation can not 
be diminished as one would like. 

We illustrate this effect with the help of an inverter chain. The threshold 
voltage was lowered to Wo by implantation. Due to fluctuations of the particle 
density the threshold voltage varies for each transistor, see Fig. 15.9. If the 
supply voltage of the inverter chain is Vb all inverters with a threshold voltage 
Vt above Vb do not switch. The probability Pi for this condition follows from 
the shaded area of Fig. 15.9. The following numerical example is derived from 
Keyes: A MOS transistor with the dimension of (0.1 x 0.5) /irn 2 contains about 
1000 ± 35 doping atoms inside the space-charge region. This variation of the 
doping concentration results in a variation of the threshold voltage of ±17 mV. 
If a scattering of ±51 mV is permitted, 40 of the 1 million transistors operated 
at 2 V would fail and with it the circuit. 

From the probability of a failure the yield for the inverter chain can be 
stated to be 

y> = 1 — Pi. (15.19) 
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Fig. 15.9. Impact of the parameter spread on the yield and reliability: If the thresh- 
old voltage is above the power supply voltage the inverter does not work. The shaded 
area denotes the probability of a failure 



For an inverter chain with rn stages the yield is considerably smaller, 

Y st = 17 " = (1 - P T ) m . ( 15 . 20 ) 

If one wants to achieve a given yield, Pj has to become sufficiently small. 
According to Fig. 15.9 the supply voltage Vb has to be increased. The min- 
imum supply voltage is therefore not set by the nonlinear behavior of the 
characteristic curves but by parameter variations. Mead calculated this volt- 
age for a given FET technology and for 10 7 inverters to be 700 mV. The 
problem of a reduction of the yield becomes more and more important for 
devices with decreasing feature sizes. 

Although some experts see this problem as insuperable, engineers have 
found some answers to this problem, as stated in Chap. 2. The solution to 
this problem can be achieved by two approaches: The first approach wants 
to reduce the variation region by applying local active process steps. The 
second approach uses appropriate circuit techniques to avoid the influence 
of parameter variations to the switching performance. Alternatives are the 
following two approaches [67]: 

® The channel is implemented into an undoped layer that is grown epitaxially 
upon a highly doped layer. Inside the undoped layer no scattering occurs 
and the scattering in the doped layer has almost no effect on the spread 
of threshold voltage of the transistor. 

® A futuristic approach is a modified raster scanning microscope that can 
implant single atoms. However, the time for processing a whole wafer with 
a diameter of 30 cm is unrealistically long. 
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15.5 The Limits due to Thermal Particle Motion 

These stochastic processes result in temporal and spatial fluctuations. Param- 
eter variations are related to spatial fluctuations. Temporal fluctuations arise 
due to noise, the extension of a doped region, or due to the reliability. All 
these effects have a limiting effect on microelectronics. 



15.5.1 The Debye Length 



The diffusion of charge carriers can cause a potential difference inside a semi- 
conductor. As an example, an inhomogeneous doped semiconductor is inves- 
tigated. Due to the inhomogeneous doping profile diffusion of mobile charge 
carriers occurs resulting in an electrical field within the semiconductor. At the 
thermodynamic equilibrium the currents due to diffusion and due to the elec- 
tric field must be equal at each point within the semiconductor. This condition 
can be explained with the differential equation 

qD^=pqp p E. (15.21) 

The potential (j) of each point inside the semiconductor can be found by 

integrating (15.21). With it the voltage between two points can be calculated: 

W = (tfii - p 2 ) = Vt In—. (15.22) 

Pi 

This voltage is independent of the doping profile Na(x) within the semi- 
conductor. The potential along the semiconductor can be derived from the 
solution of the Poisson equation: 

= — q( p-N A ) (15.23) 

dx 2 £ 0 e 8i 

In the case of an intrinsic semiconductor and with p = n- t exp^/Vr) we 
get (15.24) after standardization: 




dx 2 




,V/V T _ M 

Tli 



(15.24) 



The term L® is the so-called Debye length. It determines the standardiza- 
tion of the x axis and can be stated to be 



L D = 



£o £siVr 
qrii 



(15.25) 



The lower the temperature and the smaller the relative permittivity e s i 
the smaller the Debye length Lj> The Debye length is a first indicator for 
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the minimal extension of solid-state devices. Also it determines the constant 
in the exponential term and describes the fading away of disturbances within 
the semiconductor. At room temperature and for silicon the Debye length is 
around 100 nm. An example of a disturbance is the spread of a space-charge 
region. According to Schottky a space-charge region ends abruptly. 

15.5.2 Thermal Noise 



The thermal motion results in an increased resistance and in a noise voltage 
that can be measured along a semiconductor. The noise voltage is usually 
derived from the harmonic oscillator, which is excited by the thermal energy. 
Such an oscillator is a model for a short-circuited line with the wave impedance 
Z — R. Here a simplified, quantitative derivation is shown. We calculate the 
thermal noise power at an ohmic resistor, which can be measured with a 
bandpass filter as shown in Fig. 15.10. 

According to the Nyquist theorem the signal must be sampled with At < 
1/(2 Af) to obtain a complete waveform. Af corresponds to the bandwidth 
of the bandpass filter. In the case of a noisy resistor we observe the power 
and not the voltage. From thermodynamics it follows that for each degree 
of freedom, and therefore for each amount of energy, the average energy of 
AE = 1/2 kT is related. From the viewpoint of information theory the mean 
transmissible power is 



P= 1 fc T 2 Af = kT Af. 



(15.26) 



The mean noise voltage at a noise-free resistor R can be calculated with 
the help of the four-pole theory according to 



v 2 = ARkT Af. 



(15.27) 




b) 




Fig. 15.10. Noise voltage of an ohmic resistor and corresponding equivalent elec- 
tronic circuit: Noise source and noiseless resistance 



The noise voltage is proportional to the resistance, the temperature, and 
the bandwidth. In general, this voltage has a negative effect for electronic 
circuits and particularly in the processing of analogous signals. A figure of 
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merit for the characterization of the noise voltage is the signal-to-noise ratio. 
In addition the noise voltage determines the lowest voltage inside an electronic 
circuit. If a circuit is scaled down the switching energy and therefore the 
signal power becomes lower. The effect of the thermal power increases with 
the bandwidth. 

Also the minimum switching energy due to thermal fluctuations has to be 
taken into account. This energy was estimated to be 50 fcT, see Chap. 9 and 
Chap. 3. 



15.6 Reliability as Limiting Factor 

The distribution of the parameters as shown in Fig. 15.9 is only valid for the 
initial state of the devices. Devices with a threshold voltage V't above Vb 
fail. As a result of this the device yield is reduced. During the operation of 
the devices the distribution widens, which in turn causes additional failures. 
These failures determine the reliability of the devices. The reliability describes 
the average time a device is able to work. Once again the diffusion process 
provides an adequate model for the explanation [68]. 

The diffusion process always occurs at temperatures above OK and in- 
creases with rising temperatures. The latter is advantageously used in the 
manufacture of integrated circuits. If a device is operated at working tem- 
perature the diffusion process is small enough that the lifetime of a device is 
hardly reduced. On the other hand, some failure mechanisms are based on the 
diffusion process. The lapse of time for this mechanism is very fast so further 
unwanted failures occur. 




Fig. 15.11. With increasing time t\ t ‘2 the failure rate increases. This is due to 
the enlargement of the distribution caused by the diffusion 
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Figure 15.11 is based on a normal distribution of the parameters that can 
be described by (15.28). Instead of x we take the normalized threshold volt- 
age. The distribution widens with time, so the number of failures increases. 
With (15.9) the time constant for this process can be approximated to 

Tz = d = c" e JW(*T). (15.28) 

In (15.28) xq is a fixed quantity. In the above-mentioned example xq corre- 
sponds to Vbo- If the diffusion constant D is replaced with the result of (3.37) 
we get the right term of (15.28). Once again C' is a constant and Ea the 
activation energy. 

Now we compare this time constant to the time constant that describes the 
failure rate in relation, to time. In the reliability theory a first approximation, 
of the failure rate is described by an exponential function: 

N = N 0 e~ xt . (15.29) 



Nq describes the number of devices at the initial state, N the number of 
devices during operation, t the time, and A the failure rate. This equation can 
be derived from the differential equation with the condition that the relative 
failure rate per unit time is constant: 



N dt 



(15.30) 



We now consider the term 1/A as a time constant, which can be propor- 
tionally set to tz- This can be done because the processes that cause failures 
are of equal nature to those that are responsible for the widening of the pa- 
rameter distribution with time. Equation (15.31) can be derived if A is set 
proportional to 1 /tz- 

A = (7,4 e E ^/ (kT K (15.31) 

If the diffusion process is taken as a basis we can. regard the failure rate A 
as a function of the diffusion constant D in which the activation energy Ea is 
the dominant quantity. Therefore the activation energy Ea is a characteristic 
measure in microelectronics for describing failure mechanisms. Ca includes 
different quantities. 

If the failure rates Ai and A 2 are measured at two different temperatures 
T\ and T 2 we get the activation energy Ea from (15.31). A graphical interpre- 
tation of this is shown in the so-called Arrhenius Plot of Fig. 15.12. 

Ea = kTi T,. ln^. (15.32) 

±2 — ±1 Ai 

Most of the device-failure mechanisms have an activation energy around 
1 eV. In practice usually more than one failure mechanism occurs at the same 
time. These failure mechanisms have different activation energies. A superpo- 
sition of different exponential functions can be measured. 
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7i T 2 T 

Fig. 15.12. Dependence of A of the temperature T and the activation energy Ea . 
Ea can be determined by two measurements with, different temperatures 



If (15.30) is converted into a difference equation 

AN = -XNAt (15.33) 

and if AN = 1 we get the time At that passes until a failure on the average 
will occur, 

At = (15.34) 

A device with the failure rate A stated in hours will fail on average in 1/ A 
h. The time until a failure in a system with N devices occurs is reduced with 
increasing N. 

The following simple example shows that a large number of devices com- 
plicates the guarantee of a long life time and the reliability. The mean time 
between failure (MTBF) which is set to 1/A is around 1 000 a for todays com- 
plex microelectronic circuits. In case of a digital clock with only one chip this 
number results in an outstandingly good reliability^. On the other hand a main- 
frame memory with 10 000 such components will fail on average each 0.1a 
due to a fault}^ component. For a computer system this is an unacceptable 
number. A remedy for this is an improved level of quality. 

Now we assume that each memory chip contains IQ 7 devices. If we assume 
an MTBF = 1000 a the MTBF of a single device is 1/A = IQ 10 a, see Fig. 15.13. 
This time corresponds more or less to the age of the universe. Even on this 
reliability level it is not simple for the manufacturer to determine the reliability 
of his devices. Without any insight into the failure mechanisms of the devices 
the reliability can not be improved any further. In addition, the experience of 
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tested devices in real-world application is of crucial importance for increasing 
the quality. 



number of elements 




Fig. 15.13. MTBF in hours or years and the number of devices on the module or 
chip level for different failure rates A. The markers correspond to the examples in 
the text 



A product has to function for many years. However, a lifetime above 10 
years is not worth aspiring to due to increased costs. On the other hand, 
failures that occur in the first year of operation are troublesome. The reliability 
decreases from the system level down to the device level. To find the correct 
values at each level is not an easy task. 

Redundancy is a good way to increase the reliability on the hardware 
level. In contrast to software, where the concept of redundancy can be easily 
fulfilled, additional control logic for switching-on the redundant parts is area 
consuming and the logic is also subject to failures. Therefore fault-tolerant 
architectures at higher integration levels are of great importance, especially 
for microelectronics of the next generation. 

In the regime of nanoelectronics the discussed problems might be some- 
thing different: The characteristics of quantum electronic devices are deter- 
mined by physical constants, which in turn do not scatter. In the inside of 
molecules no diffusion processes occur, so no reliability problem exists. On 
the other hand, the issue of reliability occurs when regarding compounds, 
necessary contacts, and supporting material. These problems are partly un- 
solved. 

The same context as for reliability is true for the time: The results of any 
computation should be computed in a few hundredths or tenths of a second. 
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A faster computation is of no interest for human beings but longer calculation 
times have an effect on the impatience and reduce the effectiveness of his work. 

In practice, the limits of thermodynamics are of great priority but for 
fundamental considerations physical limits are more expressive. 



15.7 Physical Limits 

The preceding effects of classical physics permit a derivation of physical 
limits of micro- and nanoelectronics. In the following discussion the most 
important limits from thermodynamics, the theory of relativity, and from 
quantum mechanics are presented. A graphical overview of these limits is 
shown in Fig. 15.14. For a better orientation the straight-line characteristic 
for IfJ = 10““ 15 W s is shown in the power-delay chart. This number is char- 
acteristic for current integrated circuits. The classical laws of physics that 
have been under discussion for a long time are not necessarily definite limits. 
Without any doubt they are important in the periphery of nanoelectronics. 

Some of these limits might be overcome in the future, the more as quantum- 
mechanical aspects are used. When regarding the quantum computer the re- 
sults of any computation are valid without any delay in time when applying 
the appropriate input signals and after a unitary transformation. Some people 
believe that a signal propagation as fast as the speed of light is possible in 
tunneling devices. Other theories suggest that information processing can be 
done with a negligible amount of energy. 

All these ideas should be taken, with caution because new concepts for 
computers such as those for the quantum, computer could affect the limits 
presented below. 
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Fig. 15.14. Physical limits for microelectronics with regard to power and delay 
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15.7.1 Thermodynamic Limits 



Thermal movement of charge carriers impedes the current flow in a switching 
device. The dependence of the current through a pn-j unction due to an applied 
voltage is reduced at higher temperatures. The higher the temperature the 
smaller the curvature of the diode characteristic. If we want to take advantage 
of the nonlinear behavior of the device the working voltage should not be too 
small. The smallest value of this voltage is 

Vmin = r V T = r k T j g, (15.35) 

with r — 4 — 8. The higher the temperature T the higher the voltage V m i n . 

To reduce scattering inside the device during a switching event dozens of 
electrons should be involved. Therefore the lower limit for this value Ws is 
about 25 kT. Actually this value is mostly too small, so thermal fluctuations 
would occur inside the device. 

In Sect. 9.1 the effect of thermal fluctuations on switching devices was 
discussed. We derived the probability Pth that energy values due to thermal 
fluctuations are above the switching energy Ws- In the following we define the 
failure rate that is small enough for real applications. From this failure rate 
the lowest possible switching energy can be derived if the operation time to 
and the switching time ts are taken into account. From the number of possible 
switching processes n = tofts and with P t h we get the failure rate: 

Rth = hp th= t ± e -Ws/(kT)_ (15 . 36) 

t s is 

Now we assume 10 6 switching devices with a switching time ts of 1 ns. If 
only 1 failure per year (1 a = 31.5 x 10 6 s) is allowed Ws has to be above 50 k T. 
This value can be stated for the thermodynamic limit shown in Fig. 15.14 [69]. 

Another thermodynamic limit is a result of diffusion. On the one hand 
this effect is applied in the manufacturing of integrated circuits, on the other 
hand, diffusion occurs during the application of the devices, which causes a 
change in some electrical parameters. These changes can be large enough that 
the device will fail over its lifetime. In addition, the supply of energy and heat 
removal sets further limits. 



15.7.2 Relativistic Limits 

Some important limiting effects for microelectronics can be derived from the 
theory of relativity. If information is coupled to discrete energy quantities we 
must assume that these quantities can not propagate faster than the speed of 
light c. The switching times can not be reduced below a certain limit because 
the devices can not be scaled down as one would like. Let us assume that the 
active area of classical devices can not be smaller than the Debye length, then 
we get, with (15.25), a signal transfer time tj, ------ 1 fs. This value is also shown in 
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Fig. 15.14. From this idea we can also derive that the core of a microprocessor 
for future supercomputers can maximally have the volume of a sphere with 
the radius of 2.5 cm. This size follows from the fact that a light wave can cover 
the distance of 3 cm in 0.1 ns, which corresponds to a frequency of 10 GHz. 

Another limiting effect for microelectronic devices is the wave impedance of 
a transmission line, which can not be larger than Zq = 377 O. This finite wave 
impedance and the lowest possible voltage level V m i n ------ rVr (derived from 

thermodynamics) cause a power dissipation in the terminating impedances 
which can not be further reduced. With (2.11) we can calculate the power 
dissipation as 



PVL ~ 



V 2 



£ock 2 2 

2~ T r 

q2 



(15.37) 



Equation (15.37) depends only on the temperature T, which is also de- 
picted in Fig. 15.14. In microelectronics the values for the signal delay time, 
the wave impedance, and the power dissipation belong to the relativistic lim- 
its. 



15.7.3 Quantum-Mechanical Limits 

The interaction of light with a solid can only be explained by quantum me- 
chanics. Regarding (3.6) in the case of n = 1 we get the energy per switching 
event according to 

W s >—. ( 15 . 38 ) 

t s 

Or for the power: 

Pv > A ( 15 . 39 ) 

l S 

These values can not be reduced any further. They are termed the 
quantum-mechanical limit in Fig. 15.14. 

In the same way, failures of switches occur due to thermal fluctuations, 
failures due to tunneling of charge carriers inside the switches is possible. 
Tunneling is an effect of quantum mechanics. In the following a rough estima- 
tion of this type of failure is given. 



15.7.4 Equal Failure Rates by Tunneling and Thermal Noise 

The derivation of the failure probability due to thermal fluctuations assumed 
that a failure occurs each time the energy of a charge carrier is above Ws- 
If we consider failures by tunneling, the probability that a charge carrier can 
tunnel through a barrier is dependent on the energy Ws of the barrier and 
on the depth d . A first approximation for this probability is shown in (12.1) 
with P tu = D. E is the mean kinetic energy of the electrons. 
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Now we estimate the probability that failures due to tunneling and thermal 
fluctuations equal. With the assumption P t h = Ptu and with (3.30) and (9.2) 
we get: 

= M^miWs-E). (15.40) 

If the mean kinetic energy E of the electron is half the energy Ws of the 
barrier we get the first-order estimation: 



_ h^Ws/m 
2 kT 



(15.41) 



If we assume Ws = 50 kT the depth of the barrier can be calculated as 
d — 2 nm at T = 300 K. At this depth the failures caused by tunneling equal 
those caused by thermal fluctuations. Therefore the dimension of a device is 
a limit for the microminiaturization of classical devices. This limit is almost 
reached by semiconductor memories where the oxide thickness is around 5 nm. 
The charge flow due to tunneling has to be taken into account. 



15,8 Summary 



The drift current because of an electric field and the diffusion current because 
of density differences are fundamental effects when the motion of particles is 
investigated. They are not only responsible for the switching process but also 
for the reliability of the devices. Due to thermal motion the particle concen- 
tration is not constant, which causes variations of the electrical characteristics 
of the devices. When considering nanoelectronics the problems are of a differ- 
ent nature. The universal constants do not scatter and the molecules do not 
move. But the wires and the supporting material are subject to the process of 
diffusion. Reliability is a very import characteristic especially for integrated 
systems in the regime of nanoelectronics. 

The diffusion process also determines the heat conduction, which is ele- 
mentary for electronics. With the help of simple models some practical limits 
have been derived. 

In addition the physical limits are important for principle considerations. 
They define the scope for future development of classical electronics. At the 
moment, integrated electronics are far from these limits. 
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Final Objectives of Integrated Electronic 
Systems 



A book about nanoelectronics should present visions of integrating the infor- 
mation processing machines into nanoelectronics. The main objectives of the 
effort in the development of integrated electronics are systems for informa- 
tion processing without doubt. From our present view such machines must be 
as small as possible to gain new information, or to remove uncertainty in our 
knowledge [70]. These tasks should be fulfilled with as low a power as possible. 



16.1 Removal of Uncertainties by N anomachines 

The main target of information technology is the removal of uncertainties in 
our knowledge, i. e. to be well informed. Uncertainty is correlated to informa- 
tion because information is described in terms of the probability of an event. 
The uncertainty and therefore the probability are caused by a spatial event or 
are due to complexity. In the first case we can not directly observe an event, 
whereas in the second case the selection on a specific event is impossible due 
to the diversity of possible events. One task of information technology is to 
provide people with appropriate machines for removing these uncertainties. 
We should not forget that the semantic information is the relevant one for us 
whereas the technical sj^stem only handles numerical information, semantic 
information does not figure at all. 

On the one hand people have a strong demand for exchanging information 
with other people at any place, and any time. The satisfaction of these de- 
mands leads to world-wide networks and portable machines, e.g. laptops and 
cellular phones with wireless connections to the communication networks. For 
all these systems electronics have to be lightweight, small, and power saving. 
This development leads to the terms of ubiquitous computing and ubiquitous 
networking . 

In addition, there are demands of the human society to process informa- 
tion. First, mankind has to solve important complex problems such as the rate 
of economic development in future, to consider environmental pollution or to 
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forecast the weather more exactly than today, to organize the health services 
more efficiently, to note only some examples. All these problems are character- 
ized by though mathematical tasks that are inherent to them. Therefore they 
can only be solved by high computational power, which present computers do 
not provide. 

Whereas up to now 7 the development of microelectronics has pushed the 
information technology to better and better performances, the further devel- 
opment of microelectronics will be determined hy the needs of the market. 
The former case is called forward integration, which means that microelec- 
tronics intruded into systems or markets, w^ere controlled by technology, i.e. 
the integration of microelectronics in systems. From now we get a backward 
integration that yields a wide variety of applications such as voice-controlled 
computers, image-processing systems, intelligent data mining, automatic pro- 
gramming tools, design boxes, and regulating systems for complex processes, 
etc. 
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| Protected Enviroment | 

Pattern Recognition 

Process Control 



| Reai Goais 
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Fig. 16 . 1 . Goals of the information technology: Today w r e do not have an exact 
view 7 of the complexity so the significance of forward-looking microelectronics is not 
obvious, FL Martini 



Today our forecasts concerning the nanoelectronics are relatively precise, 
however, the development is just at its beginning (Fig. 16.1). The future aims 
at a kind of information processing where the information comes from the 
real world environment containing interferences and not from artificial, and 
protected systems. Neural nets of inferior animals show a behavior such as 
this. Therefore biologically inspired learning will influence new^ techniques for 
improved information processing. To take living things as an example seems to 
be reasonable since the techniques from nature may provide us with machines, 
e.g. robots, with similar features. 



16.2 Uncertainties in Nanosystems 
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One alternative leads to nanomachines based on the concept of the Turing 
machine. An interesting but more or less academic example is the DNA com- 
puter with nanomachines. Further alternatives are machines based on compu- 
tational intelligence, e.g. the artificial neural networks. All of these machines 
need an efficient communication system between the single nanomachines or 
neurons, and a powerful communication system to the outside world. In addi- 
tion these nanosystems should make use of self-structuring and self-assembling 
so the designer can concentrate on. higher levels in the design flow. This step 
if of high importance if we deal with high system complexities. 

The present strategies for designing systems are dominated by the ideas 
of microelectronics: Miniaturization of classical circuits yielding higher per- 
formances. Only this way will not be sufficient for nanoelectronics. Due to the 
large complexity of the structures we have to develop new methods for design- 
ing systems. The design methods must be compatible with the technologies. It 
is still open whether fundamentally new concepts as such quantum computing 
will come to fruition, and result in different applications. 

Today there are some speculations that nanoelectronics is able to produce 
intelligent robots that are superior to human beings. Therefore some people 
see the risk of being enslaved by these robots. However, we have no ideas 
for most characteristics of men so the question of being enslaved by robots 
is still open. The future research activities will show if machines will have a 
consciousness and if they can be an intelligent autonomous entity. 



16.2 Uncertainties in Nanosystems 

Probability is inherent to information. Additionally, it is also a characteristic 
feature of hardware, and in general for software, too, if it is complex and 
inscrutable, see Fig. 16.2. The implementation of logic functions into software 
could be made more secure and more reliable than in hardware since only 
mathematical context has to be transferred. However, reality shows a reverse 
behavior due to shortcomings and errors in computer programs. 

Numerical information can always be transformed in an equivalent net- 
work composed of single switches. These switches can be implemented in sev- 
eral physical structures. Because of economical reasons certain functions are 
implemented in software. This concept is an essential strategy for the imple- 
mentation of present systems. 

Uncertainty plays a crucial role for the behavior of switches. If these 
switches are based on many-particle systems the switching behavior is sub- 
ject to distribution functions of the particle energy, normally the Boltzmann or 
Fermi distribution. Since the distribution function reaches to infinity, thermal 
energy can cause errors in the switches of a network. Therefore all switches 
will not function in an absolutely safe manner. The same uncertainty is applies 
to a switch in a one-particle system. In this case, the Schrodinger equation 
describes the behavior of a particle. Once again the material waves extend to 
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Fig. 16.2. Effects of uncertainties in an electronic system. The objective of these 
nanomachines is to remove uncertainties from our knowledge 



infinity, and tunneling of particles can cause errors and diminish the reliability. 
The faulty behavior of the switches is usually compensated by the application 
of redundancy concepts, especially, these done in software. 

The implementation of switches in physical structures causes reliability 
problems that intensively occur on the system level. The reliability is propor- 
tional to the probability, which indicates that a machine will work correctly 
during a certain time period, the so-called lifetime of a machine. Currently 
the demand for a long lifetime is achieved by careful selection of the physi- 
cal components or switches with regard to their functionality and by special 
strategies in the development, for example by introducing redundant parts 
into the nanosystem. These problems increase with decreasing feature size of 
the devices because of the uncertain behavior of the switches. In addition, the 
number of failures grows exponentially with the number of switches or the 
complexity of the systems, as shown in Chap. 15. 

In future, engineers will have to learn how 7 to build systems that are fault 
tolerant in spite of the presence of some defective devices. This is one rea- 
son why nanoelectronics needs fault-tolerant architectures. Artificial neural 
networks are one possible solution to overcome this problem. The implicit 
programming of such neural networks draws no conclusion of the network 
itself, i.e. the fault tolerance is not deterministic. Because of this other un- 
expected errors can occur. Therefore one challenge is to find an architecture 
that yields a perfect information-processing machine. The overall challenge 
is to build machines that should reliably remove uncertainties, whereas these 
machines themselves contain uncertain effects. 



16.3 Uncertainties in the Development of 
Nanoelectronics 

One main goal in the development of hardware is the realization of information- 
processing machines that are as small as possible, as mentioned above. The 
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reasons for this strategy are high reliability, low costs, low power consumption, 
high performance, small volumes, and high processing speed. The presentation 
in Chap. 2 shows the limits of todays hardware implementations. 




Fig. 16.3. Various ways for the development of electronic systems towards nano- 
electronics 



Today we have a vision of the concept for an information-processing system 
in nanoelectronics. Devices that are controlled by a few electrons or molecules 
correspond to single bits. Electrons are moving in molecular structures that 
are built by self-organization. This concept uses the same laws for the function 
of the devices as for the structuring of the nanocircuitries. At the moment this 
level of development is a vision and will not be achieved by technology soon. 

This development can be done in many different ways, as Fig. 16.3 depicts. 
The key words in this scenario concern both the technology and the system 
architecture, and they have been explained in the preceding chapters. The dis- 
tances between the ellipses concern the potential of development. For CMOS 
technology the potential will be exhausted soon, whereas nanotechnologies 
offer a wide field with really great progress. 



16.4 Summary 

The development of information systems has to be continued by several orders 
of magnitude as current forecasts predict. This goal can be met by introducing 
nanoelectronics. Nanoelectronics offer the potential of solving very complex 
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problems by use of efficient parallel computing. In addition, nanoelectron- 
ics may offer new technologies that will substitute the very expensive silicon 
technology on a long-term basis. 

The challenge of nanosystems concentrates on two fields: First, process 
engineers have to develop technologies that allow the manufacturing of mi- 
crosystems with, nanostructures at high yields, and also with a high reliabil- 
ity. Secondly, system engineers have to develop adequate architectures for ten 
billion or even more devices to provide an innovative information processing. 
Both problems are not solved today. 

Without any doubt forecasts are questionable: The Nobel prize winning 
physicist Niels Bohr remarked: Forecasts are always difficult, especially those 
about the future. However, in the fast-growing field of electronics the remark 
of Noyce, the cofounder of Intel, is more relevant: It is much easier to forecast 
the future than it is to change the past [71]. 
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